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BOX PATENT APPLICATION 
Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

Transmitted herewith for filing is the patent application of 

Inventor(s): Y. Tom Tang, Chenghua Liu, Ping Zhou, Vinod Asundi, Feiyan Ren, Qing A. Zhao, Jie 
Zhang, Jian-Rui Wang, Tom Wehrman, Radoje T. Drmanac 

Title: NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 

1 . Type of application 

E3 This is a new application for a 
iEl Utility patent. 
□ Design patent. 

This is a continuation-in-part application of prior application no. 09/574,454 filed 
May 19, 2000, Attorney Docket No. 789CIP, which is a continuation-in-part 
application of prior application no. 09/519,705 filed March 07, 2000, Attorney 
Docket No. 789. 

2. Application Papers Enclosed 

1 Title Page 

1 1 3 Pages Of Specification (excluding Claims, Abstract, Drawings & Sequence Listing) 

4 Page(s) of Claims 

1 Page(s) of Abstract 

0 Sheet(s) of Drawings (Figs. X-X) □ Formal □ Informal 

1 02 Page(s) of Sequence Listing 



CERTIFICATION UNDER 37 CFR 1.10 



I hereby certify that this Patent Application Transmittal and the documents referred to as enclosed therewith are bemg deposited with the United States 
Postal Service on September 19, 2000, in an envelope addressed to the Assistant Commissioner for Patents, Washington, D.C. 20231 utilizing the Express 
Mail Post Office to Addressee" service of the United States Postal Service under Mailing Label No. EK916750871US 



Leslie A. Mo$i 
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3. Oath or Declaration 



□ Enclosed 

□ Executed by (check all applicable boxes) 

□ Inventor(s) 

□ Legal representative of inventors(s) (37 CFR 1 .42 or 1 .43) 

□ Joint inventor or person showing a proprietary interest on behalf of 
inventor who refused to sign or cannot be reached 

□ The petition required by 37 CFR 1 .47 and the statement required by 37 
CFR 1 .47 are enclosed. See Item 5D below for fee. 



IE1 Unexecuted - the undersigned attorney or agent is authorized to file this 

application on behalf of the applicant(s). An executed declaration will follow. 



4. Additional Papers Enclosed 

□ Preliminary Amendment 

□ Information Disclosure Statement 

□ Declaration of Biological Deposit 

ESI Computer readable copy of sequence listing containing nucleotide and/or amino 
acid sequence 

13 Statement Under 37 CFR § 1 .821 

Kl Paper copy of sequence listing identical to computer copy (102 pages) 

□ Microfiche computer program 

M Verified statement claiming small entity status under 37 CFR 1 .9 and 1 .27 

□ Associate Power of Attorney 

□ Verified translation of a non-English patent application 
IHI Return receipt postcard 

□ Other 



5, Priority Applications Under 35 USC 1 19 



Certified copies of applications from which priority under 35 USC 1 19 is claimed are 
listed below and 

□ are attached. 

□ will follow. 
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6. Filing Fee Calculation (37 CFR 1.16) 
A. El Utility Application 



CLAIMS AS FILED 


- INCLUDING PRELIMINARY A1V 


IENDMENT (D7 ANY) 


' \ \-'' '"'""if 






SMALL ENTITY 


OTHER THA 

ENT] 


N A SMALL 
1TY 




NO. FILED 


NO. EXTRA 


RATE 


FEE 


RATE 


FEE 


BASIC FEE 








$345.00 




$690.00 


TOTAL 


30-20 


= 10 


X 9 = 


$90.00 


X18 = 


$0.00 


INDEP. 


3-3 


= 0 


X39 = 


$0.00 


X78 = 


$0.00 


IE First Present 


ation of Multiple Dependent Claim 


+ 130 = 


$130.00 


+ 260 = 


$0.00 


FILING FEE: 

- 


$565.00 


OR 


$0.00 



B. □ Design Application ($155.00/$310.00) Filing Fee: $ 

C. □ Plant Application ($240.00/$480.00) Filing Fee: $ 

D. Other fees 

□ Recording Assignment [Fee - $40.00 per assignment] $ 

□ Other $ 

TOTAL FEES $ 565.00 
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7. 



Method of Payments of Fees 



□ 



Enclosed check 



Charge Deposit Account No. 501 169. A duplicate copy of this 
transmittal is enclosed 



□ 



Not enclosed 



8. 



Deposit Account and Refund Authorization 



The Commissioner is hereby authorized to charge payment of any additional fees due or 
credit any overpayment to Deposit Account No. 501 169. A duplicate copy of this 
transmittal is enclosed. 

Please refund any overpayment to Hyseq, Inc. at the address below. 

Please direct all future correspondence to Leslie A. Mooi at the address below. 



Respectfully submitted, 



Date: September 19, 2000 




Leslie A. Mooi / 
Attorney for Applicants 
Registration No.: 37,047 
HYSEQ, INC. 
670 Almanor Avenue 



Sunnyvale, CA 94085 
(408) 524-8100 

(408) 524-8145 (Telefacsimile) 
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Docket No.: 789CIP2C 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicant(s) or Patentee(s): Y. Tom Tang, Chenghua Liu, Ping Zhou, Vinod Asundi, 

Feiyan Ren, Qing A. Zhao, Jie Zhang, Jian-Rui Wang, Tom 
Wehrman, Radoje T. Drmanac 

Application No. or Patent No. : Not Yet Assigned 

Filed or Issued: Herewith 

For: NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 

VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY STATUS 
(37 CFR § 1.9(f) AND 1.27(c)) - SMALL BUSINESS CONCERN 

I hereby declare that I am 

( ) The owner of the small business concern identified below: 
(X) An official of the small business concern empowered to act on behalf of the 
concern identified below: 

NAME OF CONCERN: HYSEQ, INC. 
ADDRESS: 670 Almanor Avenue 

Sunnyvale, CA 94085 

I hereby declare that the above-identified small business concern qualifies as a small business 
concern as defined in 13 CFR § 121.12, and reproduced in 37 CFR § 1.9(d), for purposes of 
paying reduced fees under § 41(a) and (b) of Title 35, United States Code, in that the number of 
employees of the concern, including those of its affiliates, does not exceed 500 persons. For 
purposes of this statement, (1) the number of employees of the business concern is the average 
over the previous fiscal year of the concern of the persons employed on a full-time, part-time or 
temporary basis during each of the pay periods of the fiscal year, and (2) concerns are affiliates of 
each other when either, directly or indirectly, one concern controls or has the power to control the 
other, or a third party or parties controls or has the power to control both. 

I hereby declare that rights under contract or law have been conveyed to, and remain with, the 
small business concern identified above with regard to the invention, entitled NOVEL NUCLEIC 
ACIDS AND POLYPEPTIDES by inventors, Y. Tom Tang, Chenghua Liu, Ping Zhou, Vinod 
Asundi, Feiyan Ren, Qing A. Zhao, Jie Zhang, Jian-Rui Wang, Tom Wehrman, Radoje T. 
Drmanac, described in 

(X) The specification filed herewith. 

( ) Application Serial No. [ ], filed [Date]. 

( ) Patent No. [ ], issued [Date]. 
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* If the rights held by the above identified small business concern are not exclusive, each 
individual, concern or organization having rights to the invention is listed below and no rights to 
the invention are held by any person, other than the inventor, who could not qualify as a small 
business concern under 37 CFR § 1.9(c), or by any concern which would not qualify as a small 
business concern under 37 CFR § 1.9(d) or a nonprofit organization under 37 CFR § 1.9(e). 

Full Name: 

Address: 

( ) Individual ( ) Small Business Concern ( ) Nonprofit Organization 

I acknowledge the duty to file, in this application or patent, notification of any change in status 
resulting in loss of entitlement to small entity status prior to paying, or at the time of paying, the 
earliest of the issue fee or any maintenance fee due after the date on which status as a small entity 
is no longer appropriate (37 CFR § 1.28(b)). 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under § 1001 of Title 18 of the United States Code, 
and that such willful false statements may jeopardize the validity of the application, any patent 
issuing thereon, or any patent to which this verified statement is directed. 

Name of person signing: James N. Fletcher 

Title of person 

other than owner: Secretary 

Address of person signing: HYSEQ, INC. 

670 Almanor Avenue 
Sunnyvale, CA 94085 



Signature: 




Date: ll&fto 



^OTE: Separate verified statements are required from each named person, concern or organization having rights 
to the invention averring to their status as small entities. (37 CFR § 1.27) 
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NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 

1. CROSS REFERENCE TO RELATED APPLICATIONS 

5 This application is a continuation-in-part application of U.S. Application Serial 

No. 09/574,454, filed May 19, 2000, Attorney Docket No. 789CIP, which in turn is a 
continuation-in-part application of U.S. Application Serial No. 09/519,705, filed March 
07, 2000, Attorney Docket No. 789, both of which are incorporated herein by reference in 
their entirety. 

10 

2. BACKGROUND OF THE INVENTION 

2.1 TECHNICAL FIELD 

The present invention provides novel polynucleotides and proteins encoded by 
1 5 such polynucleotides, along with uses for these polynucleotides and proteins, for example 
in therapeutic, diagnostic and research methods. 

2.2 BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, 
20 such as lymphokines, interferons, CSFs, chemokines, and interleukins) has matured 
rapidly over the past decade. The now routine hybridization cloning and expression 
cloning techniques clone novel polynucleotides "directly" in the sense that they rely on 
information directly related to the discovered protein (i.e., partial DNA/amino acid 
sequence of the protein in the case of hybridization cloning; activity of the protein in the 
25 case of expression cloning). More recent "indirect" cloning techniques such as signal 
sequence cloning, which isolates DNA sequences based on the presence of a now 
well-recognized secretory leader sequence motif, as well as various PCR-based or low 
stringency hybridization-based cloning techniques, have advanced the state of the art by 
making available large numbers of DNA/amino acid sequences for proteins that are 
30 known to have biological activity, for example, by virtue of their secreted nature in the 
case of leader sequence cloning, by virtue of their cell or tissue source in the case of 
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PCR-based techniques, or by virtue of structural similarity to other genes of known 
biological activity. 

Identified polynucleotide and polypeptide sequences have numerous applications 
in, for example, diagnostics, forensics, gene mapping; identification of mutations 
responsible for genetic disorders or other traits, to assess biodiversity, and to produce 
many other types of data and products dependent on DNA and amino acid sequences. 

3. SUMMARY OF THE INVENTION 

The compositions of the present invention include novel isolated polypeptides, novel 
isolated polynucleotides encoding such polypeptides, including recombinant DNA 
molecules, cloned genes or degenerate variants thereof, especially naturally occurring 
variants such as allelic variants, antisense polynucleotide molecules, and antibodies that 
specifically recognize one or more epitopes present on such polypeptides, as well as 
hybridomas producing such antibodies. 

The compositions of the present invention additionally include vectors, including 
expression vectors, containing the polynucleotides of the invention, cells genetically 
engineered to contain such polynucleotides and cells genetically engineered to express such 
polynucleotides. 

The present invention relates to a collection or library of at least one novel nucleic 
acid sequence assembled from expressed sequence tags (ESTs) isolated mainly by 
sequencing by hybridization (SBH), and in some cases, sequences obtained from one or 
more public databases. The invention relates also to the proteins encoded by such 
polynucleotides, along with therapeutic, diagnostic and research utilities for these 
polynucleotides and proteins. These nucleic acid sequences are designated as SEQ ID NO: 
1 - 35 and are provided in the Sequence Listing. In the nucleic acids provided in the 
Sequence Listing, A is adenosine; C is cytosine; G is guanosine; T is thymine; and N is any 
of the four bases. In the amino acids provided in the Sequence Listing, * corresponds to the 
stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid 
sequences that hybridize to the complement of SEQ ID NO: 1 - 35 under stringent 
hybridization conditions; nucleic acid sequences which are allelic variants or species 
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homologues of any of the nucleic acid sequences recited above, or nucleic acid sequences 
that encode a peptide comprising a specific domain or truncation of the peptides encoded by 
SEQ ID NO: 1 - 35. A polynucleotide comprising a nucleotide sequence having at least 
90% identity to an identifying sequence of SEQ ID NO: 1 - 35 or a degenerate variant or 
fragment thereof. The identifying sequence can be 100 base pairs in length. 

The nucleic acid sequences of the present invention also include the sequence 
information from the nucleic acid sequences of SEQ ID NO: 1 - 35. The sequence 
information can be a segment of any one of SEQ ID NO: 1 - 35 that uniquely identifies or 
represents the sequence information of SEQ ID NO: 1-35. 

A collection as used in this application can be a collection of only one 
polynucleotide. The collection of sequence information or identifying information of each 
sequence can be provided on a nucleic acid array. In one embodiment, segments of 
sequence information is provided on a nucleic acid array to detect the polynucleotide that 
contains the segment. The array can be designed to detect full-match or mismatch to the 
polynucleotide that contains the segment. The collection can also be provided in a 
computer-readable format. 

This invention also includes the reverse or direct complement of any of the nucleic 
acid sequences recited above; cloning or expression vectors containing the nucleic acid 
sequences; and host cells or organisms transformed with these expression vectors. Nucleic 
acid sequences (or their reverse or direct complements) according to the invention have 
numerous applications in a variety of techniques known to those skilled in the art of 
molecular biology, such as use as hybridization probes, use as primers for PCR, use in an 
array, use in computer-readable media, use in sequencing full-length genes, use for 
chromosome and gene mapping, use in the recombinant production of protein, and use in 
the generation of anti-sense DNA or RNA, their chemical analogs and the like. 

In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1-35 or novel 
segments or parts of the nucleic acids of the invention are used as primers in expression 
assays that are well known in the art. In a particularly preferred embodiment, the nucleic 
acid sequences of SEQ ID NO: 1-35 or novel segments or parts of the nucleic acids 
provided herein are used in diagnostics for identifying expressed genes or, as well known in 
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the art and exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence 
tags for physical mapping of the human genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 
polynucleotide comprising any one of the nucleotide sequences set forth in the SEQ ID NO: 
1-35; a polynucleotide comprising any of the full length protein coding sequences of the 
SEQ ID NO: 1-35; and a polynucleotide comprising any of the nucleotide sequences of the 
mature protein coding sequences of the SEQ ID NO: 1-35. The polynucleotides of the 
present invention also include, but are not limited to, a polynucleotide that hybridizes under 
stringent hybridization conditions to (a) the complement of any one of the nucleotide 
sequences set forth in the SEQ ID NO: 1-35; (b) a nucleotide sequence encoding any one of 
the amino acid sequences set forth in the Sequence Listing; (c) a polynucleotide which is an 
allelic variant of any polynucleotides recited above; (d) a polynucleotide which encodes a 
species homolog (e.g. orthologs) of any of the proteins recited above; or (e) a 
polynucleotide that encodes a polypeptide comprising a specific domain or truncation of any 
of the polypeptides comprising an amino acid sequence set forth in the Sequence Listing. 

The isolated polypeptides of the invention include, but are not limited to, a 
polypeptide comprising any of the amino acid sequences set forth in the Sequence Listing; 
or the corresponding full length or mature protein. Polypeptides of the invention also include 
polypeptides with biological activity that are encoded by (a) any of the polynucleotides 
having a nucleotide sequence set forth in the SEQ ID NO: 1-35; or (b) polynucleotides that 
hybridize to the complement of the polynucleotides of (a) under stringent hybridization 
conditions. Biologically or immunologically active variants of any of the polypeptide 
sequences in the Sequence Listing, and "substantial equivalents" thereof (e.g., with at least 
about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% amino acid sequence identity) 
that preferably retain biological activity are also contemplated. The polypeptides of the 
invention may be wholly or partially chemically synthesized but are preferably produced by 
recombinant means using the genetically engineered cells (e.g. host cells) of the invention. 

The invention also provides compositions comprising a polypeptide of the 
invention. Polypeptide compositions of the invention may further comprise an acceptable 
carrier, such as a hydrophilic, e.g., pharmaceutically acceptable, carrier. 



789CIP2C 



4 



The invention also provides host cells transformed or transfected with a 
polynucleotide of the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture 
5 medium under conditions permitting expression of the desired polypeptide, and purifying 
the polypeptide from the culture or from the host cells. Preferred embodiments include 
those in which the protein produced by such process is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a 
variety of techniques known to those skilled in the art of molecular biology. These 
10 techniques include use as hybridization probes, use as oligomers, or primers, for PCR, 
use for chromosome and gene mapping, use in the recombinant production of protein, 
and use in generation of anti-sense DNA or RNA, their chemical analogs and the like. 
For example, when the expression of an mRNA is largely restricted to a particular cell or 
tissue type, polynucleotides of the invention can be used as hybridization probes to detect 
15 the presence of the particular cell or tissue mRNA in a sample using, e.g., in situ 
hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
expressed sequence tags for identifying expressed genes or, as well known in the art and 
exemplified by Vollrath et aL, Science 258:52-59 (1992), as expressed sequence tags for 
20 physical mapping of the human genome. 

The polypeptides according to the invention can be used in a variety of 
conventional procedures and methods that are currently applied to other proteins. For 
example, a polypeptide of the invention can be used to generate an antibody that 
specifically binds the polypeptide. Such antibodies, particularly monoclonal antibodies, 
25 are useful for detecting or quantitating the polypeptide in tissue. The polypeptides of the 
invention can also be used as molecular weight markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical 
condition which comprises the step of administering to a mammalian subject a 
therapeutically effective amount of a composition comprising a polypeptide of the 
30 present invention and a pharmaceutically acceptable carrier. 
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In particular, the polypeptides and polynucleotides of the invention can be 
utilized, for example, in methods for the prevention and/or treatment of disorders 
involving aberrant protein expression or biological activity. 

The present invention further relates to methods for detecting the presence of the 

5 polynucleotides or polypeptides of the invention in a sample. Such methods can, for 
example, be utilized as part of prognostic and diagnostic evaluation of disorders as 
recited herein and for the identification of subjects exhibiting a predisposition to such 
conditions. The invention provides a method for detecting the polynucleotides of the 
invention in a sample, comprising contacting the sample with a compound that binds to 

1 0 and forms a complex with the polynucleotide of interest for a period sufficient to form 
the complex and under conditions sufficient to form a complex and detecting the complex 
such that if a complex is detected, the polynucleotide of interest is detected. The 
invention also provides a method for detecting the polypeptides of the invention in a 
sample comprising contacting the sample with a compound that binds to and forms a 

1 5 complex with the polypeptide under conditions and for a period sufficient to form the 

complex and detecting the formation of the complex such that if a complex is formed, the 
polypeptide is detected. 

The invention also provides kits comprising polynucleotide probes and/or 
monoclonal antibodies, and optionally quantitative standards, for carrying out methods of 

20 the invention. Furthermore, the invention provides methods for evaluating the efficacy of 
drugs, and monitoring the progress of patients, involved in clinical trials for the treatment 
of disorders as recited above. 

The invention also provides methods for the identification of compounds that 
modulate (i.e., increase or decrease) the expression or activity of the polynucleotides 

25 and/or polypeptides of the invention. Such methods can be utilized, for example, for the 
identification of compounds that can ameliorate symptoms of disorders as recited herein. 
Such methods can include, but are not limited to, assays for identifying compounds and 
other substances that interact with (e.g., bind to) the polypeptides of the invention. The 
invention provides a method for identifying a compound that binds to the polypeptides of 

3 0 the invention comprising contacting the compound with a polypeptide of the invention in 
a cell for a time sufficient to form a polypeptide/compound complex, wherein the 
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complex drives expression of a reporter gene sequence in the cell; and detecting the 
complex by detecting the reporter gene sequence expression such that if expression of the 
reporter gene is detected the compound the binds to a polypeptide of the invention is 
identified. 

The methods of the invention also provides methods for treatment which involve 
the administration of the polynucleotides or polypeptides of the invention to individuals 
exhibiting symptoms or tendencies. In addition, the invention encompasses methods for 
treating diseases or disorders as recited herein comprising administering compounds and 
other substances that modulate the overall activity of the target gene products. 
Compounds and other substances can effect such modulation either on the level of target 
gene/protein expression or target protein activity. 

The polypeptides of the present invention and the polynucleotides encoding them 
are also useful for the same functions known to one of skill in the art as the polypeptides 
and polynucleotides to which they have homology (set forth in Table 1); for which they 
have a signature region (as set forth in Table 3); or for which they have homology to a 
gene family (as set forth in Table 4). If no homology is set forth for a sequence, then the 
polypeptides and polynucleotides of the present invention are useful for a variety of 
applications, as described herein, including use in arrays for detection. 

4. DETAILED DESCRIPTION OF THE INVENTION 

4.1 DEFINITIONS 

It must be noted that as used herein and in the appended claims, the singular 
forms "a", "an" and "the" include plural references unless the context clearly dictates 
otherwise. 

The term "active" refers to those forms of the polypeptide which retain the 
biologic and/or immunologic activities of any naturally occurring polypeptide. According 
to the invention, the terms "biologically active" or "biological activity" refer to a protein 
or peptide having structural, regulatory or biochemical functions of a naturally occurring 
molecule. Likewise "immunologically active" or "immunological activity" refers to the 



789CTP2C 



7 



capability of the natural, recombinant or synthetic polypeptide to induce a specific 
immune response in appropriate animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are 
engaged in extracellular or intracellular membrane trafficking, including the export of 
secretory or enzymatic molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the 
complementary sequence 3'-TCA-5\ Complementarity between two single-stranded 
molecules may be "partial" such that only some of the nucleic acids bind or it may be 
"complete" such that total complementarity exists between the single stranded molecules. 
The degree of complementarity between the nucleic acid strands has significant effects on 
the efficiency and strength of the hybridization between the nucleic acid strands. 

The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 
differentiated cell types in an embryo or an adult, including the germ cells. The term 
"germ line stem cells (GSCs)" refers to stem cells derived from primordial stem cells that 
provide a steady and continuous source of germ cells for the production of gametes. The 
term "primordial germ cells (PGCs)" refers to a small population of cells set aside from 
other cell lineages particularly from the yolk sac, mesenteries, or gonadal ridges during 
embryogenesis that have the potential to differentiate into germ cells and other cells. 
PGCs are the source from which GSCs and ES cells are derived The PGCs, the GSCs 
and the ES cells are capable of self-renewal. Thus these cells not only populate the germ 
line and give rise to a plurality of terminally differentiated cells that comprise the adult 
specialized organs, but are able to regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides 
which modulates the expression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably 
linked sequence" when the expression of the sequence is altered by the presence of the 
EMF. EMFs include, but are not limited to, promoters, and promoter modulating 
sequences (inducible elements). One class of EMFs are nucleic acid fragments which 
induce the expression of an operably linked ORF in response to a specific regulatory 
factor or physiological event. 
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The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 
"oligonculeotide" are used interchangeably and refer to a heteropolymer of nucleotides or 
the sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic 
or synthetic origin which may be single-stranded or double-stranded and may represent 
the sense or the antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or 
RNA-like material. It is contemplated that where the polynucleotide is RNA, the T 
(thymine) in the sequences provided herein is substituted with U (uracil). Generally, 
nucleic acid segments provided by this invention may be assembled from fragments of 
the genome and short oligonucleotide linkers, or from a series of oligonucleotides, or 
from individual nucleotides, to provide a synthetic nucleic acid which is capable of being 
expressed in a recombinant transcriptional unit comprising regulatory elements derived 
from a microbial or viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," 
or "segment" or "probe" or "primer" are used interchangeable and refer to a sequence of 
nucleotide residues which are at least about 5 nucleotides, more preferably at least about 
7 nucleotides, more preferably at least about 9 nucleotides, more preferably at least about 
1 1 nucleotides and most preferably at least about 17 nucleotides. The fragment is 
preferably less than about 500 nucleotides, preferably less than about 200 nucleotides, 
more preferably less than about 100 nucleotides, more preferably less than about 50 
nucleotides and most preferably less than 30 nucleotides. Preferably the probe is from 
about 6 nucleotides to about 200 nucleotides, preferably from about 15 to about 50 
nucleotides, more preferably from about 17 to 30 nucleotides and most preferably from 
about 20 to 25 nucleotides. Preferably the fragments can be used in polymerase chain 
reaction (PCR), various hybridization procedures or microarray procedures to identify or 
amplify identical or related parts of mRNA or DNA molecules. A fragment or segment 
may uniquely identify each polynucleotide sequence of the present invention. Preferably 
the fragment comprises a sequence substantially similar to any one of SEQ ID NOs:l-35. 

Probes may, for example, be used to determine whether specific mRNA 
molecules are present in a cell or tissue or to isolate similar nucleic acid sequences from 
chromosomal DNA as described by Walsh et al. (Walsh, P.S. et al., 1992, PCR Methods 
Appl 1 :241-250). They may be labeled by nick translation, Klenow fill-in reaction, PCR, 



789CIP2C 



9 



or other methods well known in the art. Probes of the present invention, their preparation 
and/or labeling are elaborated in Sambrook, J. et al., 1989, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory, NY; or Ausubel, F.M. et al., 1989, 
Current Protocols in Molecular Biology, John Wiley & Sons, New York NY, both of 
which are incorporated herein by reference in their entirety. 

The nucleic acid sequences of the present invention also include the sequence 
information from the nucleic acid sequences of SEQ ID NOs: 1-35. The sequence 
information can be a segment of any one of SEQ ID NOs: 1-35 that uniquely identifies or 
represents the sequence information of that sequence of SEQ ID NO: 1-35. One such 
segment can be a twenty-mer nucleic acid sequence because the probability that a twenty- 
mer is fully matched in the human genome is 1 in 300. In the human genome, there are 
three billion base pairs in one set of chromosomes. Because 4 20 possible twenty-mers 
exist, there are 300 times more twenty-mers than there are base pairs in a set of human 
chromosome. Using the same analysis, the probability for a seventeen-mer to be fully 
matched in the human genome is approximately 1 in 5. When these segments are used in 
arrays for expression studies, fifteen-mer segments can be used. The probability that the 
fifteen-mer is fully matched in the expressed sequences is also approximately one in five 
because expressed sequences comprise less than approximately 5% of the entire genome 
sequence. 

Similarly, when using sequence information for detecting a single mismatch, a 
segment can be a twenty-five mer. The probability that the twenty-five mer would appear in 
a human genome with a single mismatch is calculated by multiplying the probability for a 
Ml match (l-f-4 25 ) times the increased probability for mismatch at each nucleotide position 
(3 x 25). The probability that an eighteen mer with a single mismatch can be detected in an 
array for expression studies is approximately one in five. The probability that a twenty-mer 
with a single mismatch can be detected in a human genome is approximately one in five. 

The term "open reading frame," ORF, means a series of nucleotide triplets coding 
for amino acids without any termination codons and is a sequence translatable into 
protein. 

The terms "operably linked" or "operably associated" refer to functionally related 
nucleic acid sequences. For example, a promoter is operably associated or operably 
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linked with a coding sequence if the promoter controls the transcription of the coding 
sequence. While operably linked nucleic acid sequences can be contiguous and in the 
same reading frame, certain genetic elements e.g. repressor genes are not contiguously 
linked to the coding sequence but still control transcription/translation of the coding 
5 sequence. 

The term "pluripotent" refers to the capability of a cell to differentiate into a 
number of differentiated cell types that are present in an adult organism. A pluripotent 
cell is restricted in its differentiation capability in comparison to a totipotent cell. 
The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an 

1 0 oligopeptide, peptide, polypeptide or protein sequence or fragment thereof and to 
naturally occurring or synthetic molecules. A polypeptide "fragment," "portion," or 
"segment" is a stretch of amino acid residues of at least about 5 amino acids, preferably at 
least about 7 amino acids, more preferably at least about 9 amino acids and most 
preferably at least about 17 or more amino acids. The peptide preferably is not greater 

1 5 than about 200 amino acids, more preferably less than 150 amino acids and most 

preferably less than 100 amino acids. Preferably the peptide is from about 5 to about 200 
amino acids. To be active, any polypeptide must have sufficient length to display 
biological and/or immunological activity. 

The term "naturally occurring polypeptide" refers to polypeptides produced by 

20 cells that have not been genetically engineered and specifically contemplates various 
polypeptides arising from post-translational modifications of the polypeptide including, 
but not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation 
and acylation. 

The term "translated protein coding portion" means a sequence which encodes for 
25 the full length protein which may include any leader sequence or any processing 
sequence. 

The term "mature protein coding sequence" means a sequence which encodes a 
peptide or protein without a signal or leader sequence. The peptide may have been 
produced by processing in the cell which removes any leader/signal sequence. The 
30 peptide may be produced synthetically or the protein may have been produced using a 
polynucleotide only encoding for the mature protein coding sequence. 
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The term "derivative" refers to polypeptides chemically modified by such 
techniques as ubiquitination, labeling (e.g., with radionuclides or various enzymes), 
covalent polymer attachment such as pegylation (derivatization with polyethylene glycol) 
and insertion or substitution by chemical synthesis of amino acids such as ornithine, 

5 which do not normally occur in human proteins. 

The term "variant"(or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created 
using, e g., recombinant DNA techniques. Guidance in determining which amino acid 
residues maybe replaced, added or deleted without abolishing activities of interest, may 

1 0 be found by comparing the sequence of the particular polypeptide with that of 

homologous peptides and minimizing the number of amino acid sequence changes made 
in regions of high homology (conserved regions) or by replacing amino acids with 
consensus sequence. 

Alternatively, recombinant variants encoding these same or similar polypeptides 

1 5 may be synthesized or selected by making use of the "redundancy" in the genetic code. 
Various codon substitutions, such as the silent changes which produce various restriction 
sites, may be introduced to optimize cloning into a plasmid or viral vector or expression 
in a particular prokaryotic or eukaryotic system. Mutations in the polynucleotide 
sequence may be reflected in the polypeptide or domains of other peptides added to the 

20 polypeptide to modify the properties of any part of the polypeptide, to change 
characteristics such as ligand-binding affinities, interchain affinities, or 
degradation/turnover rate. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid 
with another amino acid having similar structural and/or chemical properties, i.e., 

25 conservative amino acid replacements. "Conservative" amino acid substitutions may be 
made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity, and/or the amphipathic nature of the residues involved. For example, 
nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, 
phenylalanine, tryptophan, and methionine; polar neutral amino acids include glycine, 

30 serine, threonine, cysteine, tyrosine, asparagine, and glutamine; positively charged (basic) 
amino acids include arginine, lysine, and histidine; and negatively charged (acidic) amino 
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acids include aspartic acid and glutamic acid. "Insertions" or "deletions" are preferably 
in the range of about 1 to 20 amino acids, more preferably 1 to 10 amino acids. The 
variation allowed may be experimentally determined by systematically making insertions, 
deletions, or substitutions of amino acids in a polypeptide molecule using recombinant 

5 DNA techniques and assaying the resulting recombinant variants for activity. 

Alternatively, where alteration of function is desired, insertions, deletions or 
non-conservative alterations can be engineered to produce altered polypeptides. Such 
alterations can, for example, alter one or more of the biological functions or biochemical 
characteristics of the polypeptides of the invention. For example, such alterations may 

1 0 change polypeptide characteristics such as ligand-binding affinities, interchain affinities, 
or degradation/turnover rate. Further, such alterations can be selected so as to generate 
polypeptides that are better suited for expression, scale up and the like in the host cells 
chosen for expression. For example, cysteine residues can be deleted or substituted with 
another amino acid residue in order to eUminate disulfide bridges. 

1 5 The terms "purified" or "substantially purified" as used herein denotes that the 

indicated nucleic acid or polypeptide is present in the substantial absence of other 
biological macromolecules, e.g., polynucleotides, proteins, and the like. In one 
embodiment, the polynucleotide or polypeptide is purified such that it constitutes at least 
95% by weight, more preferably at least 99% by weight, of the indicated biological 

20 macromolecules present (but water, buffers, and other small molecules, especially 
molecules having a molecular weight of less than 1000 daltons, can be present). 

The term "isolated" as used herein refers to a nucleic acid or polypeptide 
separated from at least one other component (e.g., nucleic acid or polypeptide) present 
with the nucleic acid or polypeptide in its natural source. In one embodiment, the nucleic 

25 acid or polypeptide is found in the presence of (if anything) only a solvent, buffer, ion, or 
other component normally present in a solution of the same. The terms "isolated" and 
"purified" do not encompass nucleic acids or polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, 
means that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, 

30 or mammalian) expression systems. "Microbial" refers to recombinant polypeptides or 
proteins made in bacterial or fungal (e.g., yeast) expression systems. As a product, 
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"recombinant microbial" defines a polypeptide or protein essentially free of native 
endogenous substances and unaccompanied by associated native glycosylation. 
Polypeptides or proteins expressed in most bacterial cultures, e.g., E. coli, will be free of 
glycosylation modifications; polypeptides or proteins expressed in yeast will have a 

5 glycosylation pattern in general different from those expressed in mammalian cells. 

The term "recombinant expression vehicle or vector" refers to a plasmid or phage 
or virus or vector, for expressing a polypeptide from a DNA (RNA) sequence. An 
expression vehicle can comprise a transcriptional unit comprising an assembly of (1) a 
genetic element or elements having a regulatory role in gene expression, for example, 

10 promoters or enhancers, (2) a structural or coding sequence which is transcribed into 
mRNA and translated into protein, and (3) appropriate transcription initiation and 
termination sequences. Structural units intended for use in yeast or eukaryotic expression 
systems preferably include a leader sequence enabling extracellular secretion of 
translated protein by a host cell. Alternatively, where recombinant protein is expressed 

1 5 without a leader or transport sequence, it may include an amino terminal methionine 
residue. This residue may or may not be subsequently cleaved from the expressed 
recombinant protein to provide a final product. 

The term "recombinant expression system" means host cells which have stably 
integrated a recombinant transcriptional unit into chromosomal DNA or carry the 

20 recombinant transcriptional unit extrachromosomally. Recombinant expression systems 
as defined herein will express heterologous polypeptides or proteins upon induction of 
the regulatory elements linked to the DNA segment or synthetic gene to be expressed. 
This term also means host cells which have stably integrated a recombinant genetic 
element or elements having a regulatory role in gene expression, for example, promoters 

25 or enhancers. Recombinant expression systems as defined herein will express 

polypeptides or proteins endogenous to the cell upon induction of the regulatory elements 
linked to the endogenous DNA segment or gene to be expressed. The cells can be 
prokaryotic or eukaryotic. 

The term "secreted" includes a protein that is transported across or through a 

30 membrane, including transport as a result of signal sequences in its amino acid sequence 
when it is expressed in a suitable host cell. "Secreted" proteins include without limitation 

789CIP2C 14 



4 



proteins secreted wholly (e.g., soluble proteins) or partially (e.g., receptors) from the cell 
in which they are expressed. "Secreted" proteins also include without limitation proteins 
that are transported across the membrane of the endoplasmic reticulum. "Secreted" 
proteins are also intended to include proteins containing non-typical signal sequences 
5 (e.g. Merleukin-1 Beta, see Krasney, P.A. and Young, P.R. (1992) Cytokine 4(2): 134 
-143) and factors released from damaged cells (e.g. Interleukin-1 Receptor Antagonist, 
see Arend, W.P. et al. (1998) Annu. Rev. Immunol. 16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or 
leader sequence" which will direct the polypeptide through the membrane of a cell. Such 

10 a sequence may be naturally present on the polypeptides of the present invention or 
provided from heterologous protein sources by recombinant DNA techniques. 

The term "stringent" is used to refer to conditions that are commonly understood 
in the art as stringent. Stringent conditions can include highly stringent conditions (i.e., 
hybridization to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 

15 1 mM EDTA at 65°C, and washing in 0.1X SSC/0.1% SDS at 68°C), and moderately 
stringent conditions (i.e., washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary 
hybridization conditions are described herein in the examples. 

In instances of hybridization of deoxyoligonucleotides, additional exemplary 
stringent hybridization conditions include washing in 6X SSC/0.05% sodium 

20 pyrophosphate at 37°C (for 14-base oligonucleotides), 48°C (for 17-base oligos), 55°C 
(for 20-base oligonucleotides), and 60°C (for 23-base oligonucleotides). 

As used herein, "substantially equivalent" can refer both to nucleotide and amino 
acid sequences, for example a mutant sequence, that varies from a reference sequence by 
one or more substitutions, deletions, or additions, the net effect of which does not result 

25 in an adverse functional dissimilarity between the reference and subject sequences. 

Typically, such a substantially equivalent sequence varies from one of those listed herein 
by no more than about 35% (i.e., the number of individual residue substitutions, 
additions, and/or deletions in a substantially equivalent sequence, as compared to the 
corresponding reference sequence, divided by the total number of residues in the 

30 substantially equivalent sequence is about 0.35 or less). Such a sequence is said to have 
65% sequence identity to the listed sequence. In one embodiment, a substantially 
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equivalent, e.g., mutant, sequence of the invention varies from a listed sequence by no 
more than 30% (70% sequence identity); in a variation of this embodiment, by no more 
than 25% (75% sequence identity); and in a further variation of this embodiment, by no 
more than 20% (80% sequence identity) and in a further variation of this embodiment, by 
5 no more than 10% (90% sequence identity) and in a further variation of this embodiment, 
by no more that 5% (95% sequence identity). Substantially equivalent, e.g., mutant, 
amino acid sequences according to the invention preferably have at least 80% sequence 
identity with a listed amino acid sequence, more preferably at least 90% sequence 
identity. Substantially equivalent nucleotide sequences of the invention can have lower 

10 percent sequence identities, taking into account, for example, the redundancy or 

degeneracy of the genetic code. Preferably, nucleotide sequence has at least about 65% 
identity, more preferably at least about 75% identity, and most preferably at least about 
95% identity. For the purposes of the present invention, sequences having substantially 
equivalent biological activity and substantially equivalent expression characteristics are 

1 5 considered substantially equivalent. For the purposes of determining equivalence, 
truncation of the mature sequence (e.g., via a mutation which creates a spurious stop 
codon) should be disregarded. Sequence identity may be determined, e.g., using the 
Jotun Hein method (Hein, J. (1990) Methods EnzymoL 183:626-645). Identity between 
sequences can also be determined by other methods known in the art, e.g. by varying 

20 hybridization conditions. 

The term "totipotent" refers to the capability of a cell to differentiate into all of 
the cell types of an adult organism. 

The term "transformation" means introducing DNA into a suitable host cell so 
that the DNA is replicable, either as an extrachromosomal element, or by chromosomal 

25 integration. The term "transfection" refers to the taking up of an expression vector by a 
suitable host cell, whether or not any coding sequences are in fact expressed. The term 
"infection" refers to the introduction of nucleic acids into a suitable host cell by use of a 
virus or viral vector. 

As used herein, an "uptake modulating fragment," UMF, means a series of 

30 nucleotides which mediate the uptake of a linked DNA fragment into a cell. UMFs can 
be readily identified using known UMFs as a target sequence or target motif with the 
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computer-based systems described below. The presence and activity of a UMF can be 
confirmed by attaching the suspected UMF to a marker sequence. The resulting nucleic 
acid molecule is then incubated with an appropriate host under appropriate conditions and 
the uptake of the marker sequence is determined. As described above, a UMF will 
5 increase the frequency of uptake of a linked marker sequence. 

Each of the above terms is meant to encompass all that is described for each, 
unless the context dictates otherwise. 

4.2 NUCLEIC ACIDS OF THE INVENTION 

10 Nucleotide sequences of the invention are set forth in the Sequence Listing. 

The isolated polynucleotides of the invention include a polynucleotide comprising 
the nucleotide sequences of the SEQ ID NO: 1 - 35; a polynucleotide encoding any one 
of the peptide sequences of SEQ ID NO: 1 - 35; and a polynucleotide comprising the 
nucleotide sequence encoding the mature protein coding sequence of the polynucleotides 

1 5 of any one of SEQ ID NO: 1-35. The polynucleotides of the present invention also 

include, but are not limited to, a polynucleotide that hybridizes under stringent conditions 
to (a) the complement of any of the nucleotides sequences of the SEQ ID NO: 1 - 35; (b) 
nucleotide sequences encoding any one of the amino acid sequences set forth in the 
Sequence Listing; (c) a polynucleotide which is an allelic variant of any polynucleotide 

20 recited above; (d) a polynucleotide which encodes a species homolog of any of the 

proteins recited above; or (e) a polynucleotide that encodes a polypeptide comprising a 
specific domain or truncation of the polypeptides of SEQ ID NO: 1-35. Domains of 
interest may depend on the nature of the encoded polypeptide; e.g., domains in receptor- 
like polypeptides include ligand-binding, extracellular, transmembrane, or cytoplasmic 

25 domains, or combinations thereof; domains in immunoglobulin-like proteins include the 
variable immunoglobulin-like domains; domains in enzyme-like polypeptides include 
catalytic and substrate binding domains; and domains in ligand polypeptides include 
receptor-binding domains. 

The polynucleotides of the invention include naturally occurring or wholly or 

30 partially synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The 
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polynucleotides may include all of the coding region of the cDNA or may represent a 
portion of the coding region of the cDNA. 

The present invention also provides genes corresponding to the cDNA sequences 
disclosed herein. The corresponding genes can be isolated in accordance with known 
5 methods using the sequence information disclosed herein. Such methods include the 

preparation of probes or primers from the disclosed sequence information for identification 
and/or amplification of genes in appropriate genomic libraries or other sources of genomic 
materials. Further 5' and 3' sequence can be obtained using methods known in the art. For 
example, full length cDNA or genomic DNA that corresponds to any of the polynucleotides 

1 0 of the SEQ ID NO: 1 - 35 can be obtained by screening appropriate cDNA or genomic 
DNA libraries under suitable hybridization conditions using any of the polynucleotides of 
the SEQ ID NO: 1 - 35 or a portion thereof as a probe. Alternatively, the polynucleotides 
of the SEQ ID NO: 1 - 35 may be used as the basis for suitable primer(s) that allow 
identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. 

1 5 The nucleic acid sequences of the invention can be assembled from ESTs and 

sequences (including cDNA and genomic sequences) obtained from one or more public 
databases, such as dbEST, gbpri, and UniGene. The EST sequences can provide identifying 
sequence information, representative fragment or segment information, or novel segment 
information for the full-length gene. 

20 The polynucleotides of the invention also provide polynucleotides including 

nucleotide sequences that are substantially equivalent to the polynucleotides recited 
above. Polynucleotides according to the invention can have, e.g., at least about 65%, at 
least about 70%, at least about 75%, at least about 80%, more typically at least about 
90%, and even more typically at least about 95%, sequence identity to a polynucleotide 

25 recited above. 

Included within the scope of the nucleic acid sequences of the invention are 
nucleic acid sequence fragments that hybridize under stringent conditions to any of the 
nucleotide sequences of the SEQ ID NO: 1 - 35, or complements thereof, which fragment 
is greater than about 5 nucleotides, preferably 7 nucleotides, more preferably greater than 

30 9 nucleotides and most preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, 
or 20 nucleotides or more that are selective for (i.e. specifically hybridize to any one of 
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the polynucleotides of the invention) are contemplated. Probes capable of specifically 
hybridizing to a polynucleotide can differentiate polynucleotide sequences of the 
invention from other polynucleotide sequences in the same family of genes or can 
differentiate human genes from genes of other species, and are preferably based on 
5 unique nucleotide sequences. 

The sequences falling within the scope of the present invention are not limited to 
these specific sequences, but also include allelic and species variations thereof Allelic and 
species variations can be routinely determined by comparing the sequence provided in SEQ 
ID NO: 1 - 35, a representative fragment thereof, or a nucleotide sequence at least 90% 

10 identical, preferably 95% identical, to SEQ ID NOs: 1 - 35 with a sequence from another 
isolate of the same species. Furthermore, to accommodate codon variability, the invention 
includes nucleic acid molecules coding for the same amino acid sequences as do the specific 
ORFs disclosed herein. In other words, in the coding region of an ORF, substitution of one 
codon for another codon that encodes the same amino acid is expressly contemplated. 

1 5 The nearest neighbor or homology result for the nucleic acids of the present 

invention, including SEQ ID NOs: 1 - 35, can be obtained by searching a database using an 
algorithm or a program. Preferably, a BLAST which stands for Basic Local Alignment 
Search Tool is used to search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 
290-300 (1993) and Altschul S.F. et al J. Mol. Biol. 21:403-410 (1990)). Alternatively a 

20 FASTA version 3 search against Genpept, using Fastxy algorithm. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are 
also provided by the present invention. Species homologs may be isolated and identified 
by making suitable probes or primers from the sequences provided herein and screening a 
suitable nucleic acid source from the desired species. 

25 The invention also encompasses allelic variants of the disclosed polynucleotides 

or proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide 
which also encode proteins which are identical, homologous or related to that encoded by 
the polynucleotides. 

The nucleic acid sequences of the invention are further directed to sequences 

30 which encode variants of the described nucleic acids. These amino acid sequence 
variants may be prepared by methods known in the art by introducing appropriate 
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nucleotide changes into a native or variant polynucleotide. There are two variables in the 
construction of amino acid sequence variants: the location of the mutation and the nature 
of the mutation. Nucleic acids encoding the amino acid sequence variants are preferably 
constructed by mutating the polynucleotide to encode an amino acid sequence that does 
5 not occur in nature. These nucleic acid alterations can be made at sites that differ in the 
nucleic acids from different species (variable positions) or in highly conserved regions 
(constant regions). Sites at such locations will typically be modified in series, e.g., by 
substituting first with conservative choices {e.g., hydrophobic amino acid to a different 
hydrophobic amino acid) and then with more distant choices {e.g., hydrophobic amino 

10 acid to a charged amino acid), and then deletions or insertions may be made at the target 
site. Amino acid sequence deletions generally range from about 1 to 30 residues, 
preferably about 1 to 10 residues, and are typically contiguous. Amino acid insertions 
include amino- and/or carboxyl-terminal fusions ranging in length from one to one 
hundred or more residues, as well as intrasequence insertions of single or multiple amino 

15 acid residues. Intrasequence insertions may range generally from about 1 to 10 amino 
residues, preferably from 1 to 5 residues. Examples of terminal insertions include the 
heterologous signal sequences necessary for secretion or for intracellular targeting in 
different host cells and sequences such as FLAG or poly-histidine sequences useful for 
purifying the expressed protein. 

20 In a preferred method, polynucleotides encoding the novel amino acid sequences 

are changed via site-directed mutagenesis. This method uses oligonucleotide sequences 
to alter a polynucleotide to encode the desired amino acid variant, as well as sufficient 
adjacent nucleotides on both sides of the changed amino acid to form a stable duplex on 
either side of the site of being changed. In general, the techniques of site-directed 

25 mutagenesis are well known to those of skill in the art and this technique is exemplified 
by publications such as, Edelman et al, DNA 2:183 (1983). A versatile and efficient 
method for producing site-specific changes in a polynucleotide sequence was published 
by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 (1982). PCR may also be used to 
create amino acid sequence variants of the novel nucleic acids. When small amounts of 

30 template DNA are used as starting material, primer(s) that differs slightly in sequence 
from the corresponding region in the template DNA can generate the desired amino acid 
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variant. PCR amplification results in a population of product DNA fragments that differ 
from the polynucleotide template encoding the polypeptide at the position specified by 
the primer. The product DNA fragments replace the corresponding region in the plasmid 
and this gives a polynucleotide encoding the desired amino acid variant. 
5 A further technique for generating amino acid variants is the cassette mutagenesis 

technique described in Wells et al., Gene 34:315 (1985); and other mutagenesis 
techniques well known in the art, such as, for example, the techniques in Sambrook et al, 
supra, and Current Protocols in Molecular Biology, Ausubel et al. Due to the inherent 
degeneracy of the genetic code, other DNA sequences which encode substantially the 

10 same or a functionally equivalent amino acid sequence may be used in the practice of the 
invention for the cloning and expression of these novel nucleic acids. Such DNA 
sequences include those which are capable of hybridizing to the appropriate novel nucleic 
acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can 

15 be used to generate polynucleotides encoding chimeric or fusion proteins comprising one 
or more domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any 
of the polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, 
amplified, or synthetic) or RNA, Methods and algorithms for obtaining such 

20 polynucleotides are well known to those of skill in the art and can include, for example, 
methods for determining hybridization conditions that can routinely isolate 
polynucleotides of the desired sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the 
mature protein coding sequences corresponding to any one of SEQ ID NO: 1-35, or 

25 functional equivalents thereof, may be used to generate recombinant DNA molecules that 
direct the expression of that nucleic acid, or a functional equivalent thereof, in 
appropriate host cells. Also included are the cDNA inserts of any of the clones identified 
herein. 

A polynucleotide according to the invention can be joined to any of a variety of 
30 other nucleotide sequences by well-established recombinant DNA techniques (see 

Sambrook J et al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 



789CIP2C 



21 



Laboratory, NY). Useful nucleotide sequences for joining to polynucleotides include an 
assortment of vectors, e.g., plasmids, cosmids, lambda phage derivatives, phagemids, and 
the like, that are well known in the art. Accordingly, the invention also provides a vector 
including a polynucleotide of the invention and a host cell containing the polynucleotide. 
5 In general, the vector contains an origin of replication functional in at least one organism, 
convenient restriction endonuclease sites, and a selectable marker for the host cell. 
Vectors according to the invention include expression vectors, replication vectors, probe 
generation vectors, and sequencing vectors. A host cell according to the invention can be 
a prokaryotic or eukaryotic cell and can be a unicellular organism or part of a 

1 0 multicellular organism. 

The present invention further provides recombinant constructs comprising a 
nucleic acid having any of the nucleotide sequences of the SEQ ED NOs: 1 - 35 or a 
fragment thereof or any other polynucleotides of the invention. In one embodiment, the 
recombinant constructs of the present invention comprise a vector, such as a plasmid or 

15 viral vector, into which a nucleic acid having any of the nucleotide sequences of the SEQ 
ID NOs: 1 - 35 or a fragment thereof is inserted, in a forward or reverse orientation. In 
the case of a vector comprising one of the ORFs of the present invention, the vector may 
further comprise regulatory sequences, including for example, a promoter, operably 
linked to the ORF. Large numbers of suitable vectors and promoters are known to those 

20 of skill in the art and are commercially available for generating the recombinant 
constructs of the present invention. The following vectors are provided by way of 
example. Bacterial: pBs, phagescript, PsiX174, pBluescript SK, pBs KS, pNH8a, 
pNH16a, pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, 
pRIT5 (Pharmacia). Eukaryotic: pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) 

25 pSVK3, pBPV, pMSG, pSVL (Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an 
expression control sequence such as the pMT2 or pED expression vectors disclosed in 
Kaufman et al., Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein 
recombinantly. Many suitable expression control sequences are known in the art. 

30 General methods of expressing recombinant proteins are also known and are exemplified 
in R. Kaufman, Methods in Enzymology 185, 537-566 (1990). As defined herein 
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"operably linked" means that the isolated polynucleotide of the invention and an 
expression control sequence are situated within a vector or cell in such a way that the 
protein is expressed by a host cell which has been transformed (transfected) with the 
ligated polynucleotide/expression control sequence. 
5 Promoter regions can be selected from any desired gene using CAT 

(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are pKK232-8 and pCM7. Particular named bacterial promoters 
include lad, lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic promoters include CMV 
immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and 

10 mouse metallothionein-L Selection of the appropriate vector and promoter is well within 
the level of ordinary skill in the art. Generally, recombinant expression vectors will 
include origins of replication and selectable markers permitting transformation of the host 
cell, e.g., the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and a 
promoter derived from a highly-expressed gene to direct transcription of a downstream 

15 structural sequence. Such promoters can be derived from operons encoding glycolytic 
enzymes such as 3-phosphoglycerate kinase (PGK), a-factor, acid phosphatase, or heat 
shock proteins, among others. The heterologous structural sequence is assembled in 
appropriate phase with translation initiation and termination sequences, and preferably, a 
leader sequence capable of directing secretion of translated protein into the periplasmic 

20 space or extracellular medium. Optionally, the heterologous sequence can encode a 
fusion protein including an amino terminal identification peptide imparting desired 
characteristics, e.g., stabilization or simplified purification of expressed recombinant 
product. Useful expression vectors for bacterial use are constructed by inserting a 
structural DNA sequence encoding a desired protein together with suitable translation 

25 initiation and termination signals in operable reading phase with a functional promoter. 
The vector will comprise one or more phenotypic selectable markers and an origin of 
replication to ensure maintenance of the vector and to, if desirable, provide amplification 
within the host. Suitable prokaryotic hosts for transformation include E. coli, Bacillus 
subtilis, Salmonella typhimurium and various species within the genera Pseudomonas, 

30 Streptomyces, and Staphylococcus, although others may also be employed as a matter of 
choice. 
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As a representative but non-limiting example, useful expression vectors for 
bacterial use can comprise a selectable marker and bacterial origin of replication derived 
from commercially available plasmids comprising genetic elements of the well known 
cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, 
5 pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (Promega 

Biotech, Madison, WI, USA). These pBR322 "backbone" sections are combined with an 
appropriate promoter and the structural sequence to be expressed. Following 
transformation of a suitable host strain and growth of the host strain to an appropriate cell 
density, the selected promoter is induced or derepressed by appropriate means (e.g., 

10 temperature shift or chemical induction) and cells are cultured for an additional period. 
Cells are typically harvested by centrifugation, disrupted by physical or chemical means, 
and the resulting crude extract retained for further purification. 

Polynucleotides of the invention can also be used to induce immune responses. 
For example, as described in Fan et al, Nat. Biotech. 17:870-872 (1999), incorporated 

1 5 herein by reference, nucleic acid sequences encoding a polypeptide may be used to 

generate antibodies against the encoded polypeptide following topical administration of 
naked plasmid DNA or following injection, and preferably intramuscular injection of the 
DNA. The nucleic acid sequences are preferably inserted in a recombinant expression 
vector and may be in the form of naked DNA. 

20 

4.3 HOSTS 

The present invention further provides host cells genetically engineered to contain 
the polynucleotides of the invention. For example, such host cells may contain nucleic 
acids of the invention introduced into the host cell using known transformation, 

25 transfection or infection methods. The present invention still further provides host cells 
genetically engineered to express the polynucleotides of the invention, wherein such 
polynucleotides are in operative association with a regulatory sequence heterologous to 
the host cell which drives expression of the polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, 

30 or increase, expression of endogenous polypeptide. Cells can be modified (e.g., by 

homologous recombination) to provide increased polypeptide expression by replacing, in 
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whole or in part, the naturally occurring promoter with all or part of a heterologous 
promoter so that the cells express the polypeptide at higher levels. The heterologous 
promoter is inserted in such a manner that it is operatively linked to the encoding 
sequences. See, for example, PCT International Publication No. WO94/12650, PCT 
5 International Publication No. WO92/20808, and PCT International Publication No. 
WO91/09955. It is also contemplated that, in addition to heterologous promoter DNA, 
amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which 
encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) 
and/or intron DNA may be inserted along with the heterologous promoter DNA. If 

10 linked to the coding sequence, amplification of the marker DNA by standard selection 
methods results in co-amplification of the desired protein coding sequences in the cells. 

The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a 
lower eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, 
such as a bacterial cell. Introduction of the recombinant construct into the host cell can 

15 be effected by calcium phosphate transfection, DEAE, dextran mediated transfection, or 
electroporation (Davis, L, et al, Basic Methods in Molecular Biology (1986)). The host 
cells containing one of the polynucleotides of the invention, can be used in conventional 
manners to produce the gene product encoded by the isolated fragment (in the case of an 
ORF) or can be used to produce a heterologous protein under the control of the EMF. 

20 Any host/vector system can be used to express one or more of the ORFs of the 

present invention. These include, but are not limited to, eukaryotic hosts such as HeLa 
cells, Cv-1 cell, COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. 
coli and B. subtilis. The most preferred cells are those which do not normally express the 
particular polypeptide or protein or which expresses the polypeptide or protein at low 

25 natural level. Mature proteins can be expressed in mammalian cells, yeast, bacteria, or 
other cells under the control of appropriate promoters. Cell-free translation systems can 
also be employed to produce such proteins using RNAs derived from the DNA constructs 
of the present invention. Appropriate cloning and expression vectors for use with 
prokaryotic and eukaryotic hosts are described by Sambrook, et al., in Molecular 

30 Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New York (1989), 
the disclosure of which is hereby incorporated by reference. 
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Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS-7 
lines of monkey kidney fibroblasts, described by Gluzman, Cell 23:175 (1981). Other 
cell lines capable of expressing a compatible vector are, for example, the CI 27, monkey 
5 COS cells, Chinese Hamster Ovary (CHO) cells, human kidney 293 cells, human 
epidermal A431 cells, human Colo205 cells, 3T3 cells, CV-1 cells, other transformed 
primate cell lines, normal diploid cells, cell strains derived from in vitro culture of 
primary tissue, primary explants, HeLa cells, mouse L cells, BHK, HL-60, U937, HaK or 
Jurkat cells. Mammalian expression vectors will comprise an origin of replication, a 

1 0 suitable promoter and also any necessary ribosome binding sites, polyadenylation site, 
splice donor and acceptor sites, transcriptional termination sequences, and 5 ' flanking 
nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for 
example, SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be 
used to provide the required nontranscribed genetic elements. Recombinant polypeptides 

1 5 and proteins produced in bacterial culture are usually isolated by initial extraction from 
cell pellets, followed by one or more salting-out, aqueous ion exchange or size exclusion 
chromatography steps. Protein refolding steps can be used, as necessary, in completing 
configuration of the mature protein. Finally, high performance liquid chromatography 
(HPLC) can be employed for final purification steps. Microbial cells employed in 

20 expression of proteins can be disrupted by any convenient method, including freeze-thaw 
cycling, sonication, mechanical disruption, or use of cell lysing agents. 

Alternatively, it may be possible to produce the protein in lower eukaryotes such 
as yeast or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains 
include Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, 

25 Candida, or any yeast strain capable of expressing heterologous proteins. Potentially 
suitable bacterial strains include Escherichia coli, Bacillus subtilis, Salmonella 
typhimurium, or any bacterial strain capable of expressing heterologous proteins. If the 
protein is made in yeast or bacteria, it may be necessary to modify the protein produced 
therein, for example by phosphorylation or glycosylation of the appropriate sites, in order 

30 to obtain the functional protein. Such covalent attachments may be accomplished using 
known chemical or enzymatic methods. 
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In another embodiment of the present invention, cells and tissues may be 
engineered to express an endogenous gene comprising the polynucleotides of the 
invention under the control of inducible regulatory elements, in which case the regulatory 
sequences of the endogenous gene may be replaced by homologous recombination. As 
5 described herein, gene targeting can be used to replace a gene's existing regulatory region 
with a regulatory sequence isolated from a different gene or a novel regulatory sequence 
synthesized by genetic engineering methods. Such regulatory sequences may be 
comprised of promoters, enhancers, scaffold-attachment regions, negative regulatory 
elements, transcriptional initiation sites, regulatory protein binding sites or combinations 

10 of said sequences. Alternatively, sequences which affect the structure or stability of the 
RNA or protein produced may be replaced, removed, added, or otherwise modified by 
targeting. These sequence include polyadenylation signals, mRNA stability elements, 
splice sites, leader sequences for enhancing or modifying transport or secretion properties 
of the protein, or other sequences which alter or improve the function or stability of 

1 5 protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing 
the gene under the control of the new regulatory sequence, e.g., inserting a new promoter 
or enhancer or both upstream of a gene. Alternatively, the targeting event may be a 
simple deletion of a regulatory element, such as the deletion of a tissue-specific negative 

20 regulatory element. Alternatively, the targeting event may replace an existing element; 
for example, a tissue-specific enhancer can be replaced by an enhancer that has broader 
or different cell-type specificity than the naturally occurring elements. Here, the 
naturally occurring sequences are deleted and new sequences are added. In all cases, the 
identification of the targeting event may be facilitated by the use of one or more 

25 selectable marker genes that are contiguous with the targeting DNA, allowing for the 
selection of cells in which the exogenous DNA has integrated into the host cell genome. 
The identification of the targeting event may also be facilitated by the use of one or more 
marker genes exhibiting the property of negative selection, such that the negatively 
selectable marker is linked to the exogenous DNA, but configured such that the 

30 negatively selectable marker flanks the targeting sequence, and such that a correct 

homologous recombination event with sequences in the host cell genome does not result 
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in the stable integration of the negatively selectable marker. Markers useful for this 
purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial 
xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance 
with this aspect of the invention are more particularly described in U.S. Patent No. 
5,272,071 to Chappel; U.S. Patent No. 5,578,461 to Sherwin et al; International 
Application No. PCT/US 92/09627 (WO93/09222) by Selden et al; and International 
Application No. PCT/US90/06436 (WO91/06667) by Skoultchi et al, each of which is 
incorporated by reference herein in its entirety. 

4.4 POLYPEPTIDES OF THE INVENTION 

The isolated polypeptides of the invention include, but are not limited to, a 
polypeptide comprising: the amino acid sequences set forth as any one of SEQ ED NO: 1- 
35 or an amino acid sequence encoded by any one of the nucleotide sequences SEQ ED 
NOs: 1 - 35 or the corresponding full length or mature protein. Polypeptides of the 
invention also include polypeptides preferably with biological or immunological activity 
that are encoded by: (a) a polynucleotide having any one of the nucleotide sequences set 
forth in the SEQ ID NOs: 1 - 35 or (b) polynucleotides encoding any one of the amino 
acid sequences set forth as SEQ ED NO: 1-35 or (c) polynucleotides that hybridize to the 
complement of the polynucleotides of either (a) or (b) under stringent hybridization 
conditions. The invention also provides biologically active or immunologically active 
variants of any of the amino acid sequences set forth as SEQ ID NO: 1-35 or the 
corresponding full length or mature protein; and "substantial equivalents" thereof (e.g., 
with at least about 65%, at least about 70%, at least about 75%, at least about 80%, at 
least about 85%, at least about 90%, typically at least about 95%, more typically at least 
about 98%, or most typically at least about 99% amino acid identity) that retain 
biological activity. Polypeptides encoded by allelic variants may have a similar, 
increased, or decreased activity compared to polypeptides comprising SEQ ID NO: 1-35. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the 
protein may be in linear form or they may be cyclized using known methods, for 
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example, as described in H. U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and 
in R. S. McDowell, et aL, J. Amer. Chem. Soc. 1 14, 9245-9253 (1992), both of which are 
incorporated herein by reference. Such fragments may be fused to carrier molecules such 
as immunoglobulins for many purposes, including increasing the valency of protein 
5 binding sites. 

The present invention also provides both full-length and mature forms (for 
example, without a signal sequence or precursor sequence) of the disclosed proteins. The 
protein coding sequence is identified in the sequence listing by translation of the 
disclosed nucleotide sequences. The mature form of such protein may be obtained by 

10 expression of a full-length polynucleotide in a suitable mammalian cell or other host cell. 
The sequence of the mature form of the protein is also determinable from the amino acid 
sequence of the full-length form. Where proteins of the present invention are membrane 
bound, soluble forms of the proteins are also provided. In such forms, part or all of the 
regions causing the proteins to be membrane bound are deleted so that the proteins are 

1 5 fully secreted from the cell in which it is expressed. 

Protein compositions of the present invention may further comprise an acceptable 
carrier, such as a hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 

The present invention further provides isolated polypeptides encoded by the 
nucleic acid fragments of the present invention or by degenerate variants of the nucleic 

20 acid fragments of the present invention. By "degenerate variant" is intended nucleotide 
fragments which differ from a nucleic acid fragment of the present invention (e.g. , an 
ORF) by nucleotide sequence but, due to the degeneracy of the genetic code, encode an 
identical polypeptide sequence. Preferred nucleic acid fragments of the present invention 
are the ORFs that encode proteins. 

25 A variety of methodologies known in the art can be utilized to obtain any one of 

the isolated polypeptides or proteins of the present invention. At the simplest level, the 
amino acid sequence can be synthesized using commercially available peptide 
synthesizers. The synthetically-constructed protein sequences, by virtue of sharing 
primary, secondary or tertiary structural and/or conformational characteristics with 

30 proteins may possess biological properties in common therewith, including protein 

activity. This technique is particularly useful in producing small peptides and fragments 
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of larger polypeptides. Fragments are useful, for example, in generating antibodies 
against the native polypeptide. Thus, they may be employed as biologically active or 
immunological substitutes for natural, purified proteins in screening of therapeutic 
compounds and in immunological processes for the development of antibodies. 
5 The polypeptides and proteins of the present invention can alternatively be 

purified from cells which have been altered to express the desired polypeptide or protein. 
As used herein, a cell is said to be altered to express a desired polypeptide or protein 
when the cell, through genetic manipulation, is made to produce a polypeptide or protein 
which it normally does not produce or which the cell normally produces at a lower level. 
10 One skilled in the art can readily adapt procedures for introducing and expressing either 
recombinant or synthetic sequences into eukaryotic or prokaryotic cells in order to 
generate a cell which produces one of the polypeptides or proteins of the present 
invention. 

The invention also relates to methods for producing a polypeptide comprising 

1 5 growing a culture of host cells of the invention in a suitable culture medium, and 
purifying the protein from the cells or the culture in which the cells are grown. For 
example, the methods of the invention include a process for producing a polypeptide in 
which a host cell containing a suitable expression vector that includes a polynucleotide of 
the invention is cultured under conditions that allow expression of the encoded 

20 polypeptide. The polypeptide can be recovered from the culture, conveniently from the 
culture medium, or from a lysate prepared from the host cells and further purified. 
Preferred embodiments include those in which the protein produced by such process is a 
full length or mature form of the protein. 

In an alternative method, the polypeptide or protein is purified from bacterial 

25 cells which naturally produce the polypeptide or protein. One skilled in the art can 

readily follow known methods for isolating polypeptides and proteins in order to obtain 
one of the isolated polypeptides or proteins of the present invention. These include, but 
are not limited to, immunochromatography, HPLC, size-exclusion chromatography, 
ion-exchange chromatography, and immuno-affinity chromatography. See, e.g., Scopes, 

30 Protein Purification: Principles and Practice, Springer- Verlag (1994); Sambrook, et al., 
in Molecular Cloning: A Laboratory Manual; Ausubel et al., Current Protocols in 
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Molecular Biology, Polypeptide fragments that retain biological/immuno logical activity 
include fragments comprising greater than about 100 amino acids, or greater than about 
200 amino acids, and fragments that encode specific protein domains. 

The purified polypeptides can be used in in vitro binding assays which are well 
5 known in the art to identify molecules which bind to the polypeptides. These molecules 
include but are not limited to, for e.g., small molecules, molecules from combinatorial 
libraries, antibodies or other proteins. The molecules identified in the binding assay are 
then tested for antagonist or agonist activity in in vivo tissue culture or animal models 
that are well known in the art. In brief, the molecules are titrated into a plurality of cell 
1 0 cultures or animals and then tested for either cell/animal death or prolonged survival of 
the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the 
peptides may be complexed with toxins, e.g., ricin or cholera, or with other compounds 
that are toxic to cells. The toxin-binding molecule complex is then targeted to a tumor or 
15 other cell by the specificity of the binding molecule for SEQ ED NO: 1-35. 

The protein of the invention may also be expressed as a product of transgenic 
animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which 
are characterized by somatic or germ cells containing a nucleotide sequence encoding the 
protein. 

20 The proteins provided herein also include proteins characterized by amino acid 

sequences similar to those of purified proteins but into which modification are naturally 
provided or deliberately engineered. For example, modifications, in the peptide or DNA 
sequence, can be made by those skilled in the art using known techniques. Modifications 
of interest in the protein sequences may include the alteration, substitution, replacement, 

25 insertion or deletion of a selected amino acid residue in the coding sequence. For 

example, one or more of the cysteine residues may be deleted or replaced with another 
amino acid to alter the conformation of the molecule. Techniques for such alteration, 
substitution, replacement, insertion or deletion are well known to those skilled in the art 
(see, e.g., U.S. Pat. No. 4,518,584). Preferably, such alteration, substitution, replacement, 

30 insertion or deletion retains the desired activity of the protein. Regions of the protein that 
are important for the protein function can be determined by various methods known in 
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the art including the alanine-scanning method which involved systematic substitution of 
single or strings of amino acids with alanine, followed by testing the resulting 
alanine-containing variant for biological activity. This type of analysis determines the 
importance of the substituted amino acid(s) in biological activity. Regions of the protein 
5 that are important for protein function may be determined by the eMATRIX program. 

Other fragments and derivatives of the sequences of proteins which would be 
expected to retain protein activity in whole or in part and are useful for screening or other 
immunological methodologies may also be easily made by those skilled in the art given 
the disclosures herein. Such modifications are encompassed by the present invention. 

1 0 The protein may also be produced by operably linking the isolated polynucleotide 

of the invention to suitable control sequences in one or more insect expression vectors, 
and employing an insect expression system. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, 
e.g., Invitrogen, San Diego, Calif., U.S.A. (the MaxBat™ kit), and such methods are well 

15 known in the art, as described in Summers and Smith, Texas Agricultural Experiment 
Station Bulletin No. 1555 (1987), incorporated herein by reference. As used herein, an 
insect cell capable of expressing a polynucleotide of the present invention is 
"transformed." 

The protein of the invention may be prepared by culturing transformed host cells 
20 under culture conditions suitable to express the recombinant protein. The resulting 

expressed protein may then be purified from such culture (i.e., from culture medium or 
cell extracts) using known purification processes, such as gel filtration and ion exchange 
chromatography. The purification of the protein may also include an affinity column 
containing agents which will bind to the protein; one or more column steps over such 
25 affinity resins as concanavalin A-agarose, heparin-toyopearl™ or Cibacrom blue 3GA 
Sepharose™; one or more steps involving hydrophobic interaction chromatography using 
such resins as phenyl ether, butyl ether, or propyl ether; or immunoaffinity 
chromatography. 

Alternatively, the protein of the invention may also be expressed in a form which 
30 will facilitate purification. For example, it may be expressed as a fusion protein, such as 
those of maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin 
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(TRX), or as a His tag. Kits for expression and purification of such fusion proteins are 
commercially available from New England BioLab (Beverly, Mass.), Pharmacia 
(Piscataway, NJ.) and Invitrogen, respectively. The protein can also be tagged with an 
epitope and subsequently purified by using a specific antibody directed to such epitope. 
5 One such epitope ("FLAG®") is commercially available from Kodak (New Haven, 
Conn.). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
methyl or other aliphatic groups, can be employed to further purify the protein. Some or 

10 all of the foregoing purification steps, in various combinations, can also be employed to 
provide a substantially homogeneous isolated recombinant protein. The protein thus 
purified is substantially free of other mammalian proteins and is defined in accordance 
with the present invention as an "isolated protein." 

The polypeptides of the invention include analogs (variants). This embraces 

15 fragments, as well as peptides in which one or more amino acids has been deleted, 
inserted, or substituted. Also, analogs of the polypeptides of the invention embrace 
fusions of the polypeptides or modifications of the polypeptides of the invention, wherein 
the polypeptide or analog is fused to another moiety or moieties, e.g., targeting moiety or 
another therapeutic agent. Such analogs may exhibit improved properties such as activity 

20 and/or stability. Examples of moieties which may be fused to the polypeptide or an 
analog include, for example, targeting moieties which provide for the delivery of 
polypeptide to pancreatic cells, e.g., antibodies to pancreatic cells, antibodies to immune 
cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well as receptor 
and ligands expressed on pancreatic or immune cells. Other moieties which may be 

25 fused to the polypeptide include therapeutic agents which are used for treatment, for 
example, immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CD3 
antibodies and steroids. Also, polypeptides may be fused to immune modulators, and 
other cytokines such as alpha or beta interferon. 

30 4.4.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE 

IDENTITY AND SIMILARITY 
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Preferred identity and/or similarity are designed to give the largest match 
between the sequences tested. Methods to determine identity and similarity are codified 
in computer programs including, but are not limited to, the GCG program package, 
including GAP (Devereux, J., et al. Nucleic Acids Research 12(1):387 (1984); Genetics 
5 Computer Group, University of Wisconsin, Madison, WI), BLASTP, BLASTN, 

BLASTX, FASTA (Altschul, S.R et al, J. Molec. Biol. 215:403-410 (1990), PSI-BLAST 
(Altschul S.F. et al., Nucleic Acids Res. vol. 25, pp. 3389-3402, herein incorporated by 
reference), eMatrix software (Wu et al, J. Comp. Biol., Vol. 6, pp. 219-235 (1999), 
herein incorporated by reference), eMotif software (Nevill-Manning et al, ISMB-97, Vol. 

10 4, pp. 202-209, herein incorporated by reference), pFam software (Sonnhammer et al., 
Nucleic Acids Res., Vol. 26(1), pp. 320-322 (1998), herein incorporated by reference) 
and the Kyte-Doolittle hydrophobocity prediction algorithm (J. Mol Biol, 157, pp. 105-31 
(1982), incorporated herein by reference). The BLAST programs are publicly available 
from the National Center for Biotechnology Information (NCBI) and other sources 

15 (BLAST Manual, Altschul, S., et al. NCB NLM NIH Bethesda, MD 20894; Altschul, S., 
et al, J. Mol. Biol. 215:403-410 (1990). 

4.5 GENE THERAPY 

Mutations in the polynucleotides of the invention gene may result in loss of 
20 normal function of the encoded protein. The invention thus provides gene therapy to 
restore normal activity of the polypeptides of the invention; or to treat disease states 
involving polypeptides of the invention. Delivery of a functional gene encoding 
polypeptides of the invention to appropriate cells is effected ex vivo, in situ, or in vivo by 
use of vectors, and more particularly viral vectors (e.g., adenovirus, adeno-associated 
25 virus, or a retrovirus), or ex vivo by use of physical DNA transfer methods (e.g., 

liposomes or chemical treatments). See, for example, Anderson, Nature, supplement to 
vol. 392, no. 6679, pp.25-20 (1998). For additional reviews of gene therapy technology 
see Friedmann, Science, 244: 1275-1281 (1989); Verma, Scientific American: 68-84 
(1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any one of the 
30 nucleotides of the present invention or a gene encoding the polypeptides of the present 
invention can also be accomplished with extrachromosomal substrates (transient 



789CIP2C 



34 



expression) or artificial chromosomes (stable expression). Cells may also be cultured ex 
vivo in the presence of proteins of the present invention in order to proliferate or to 
produce a desired effect on or activity in such cells. Treated cells can then be introduced 
in vivo for therapeutic purposes. Alternatively, it is contemplated that in other human 
5 disease states, preventing the expression of or inhibiting the activity of polypeptides of 
the invention will be useful in treating the disease states. It is contemplated that antisense 
therapy or gene therapy could be applied to negatively regulate the expression of 
polypeptides of the invention. 

Other methods inhibiting expression of a protein include the introduction of 

1 0 antisense molecules to the nucleic acids of the present invention, their complements, or their 
translated RNA sequences, by methods known in the art. Further, the polypeptides of the 
present invention can be inhibited by using targeted deletion methods, or the insertion of a 
negative regulatory element such as a silencer, which is tissue specific. 

The present invention still further provides cells genetically engineered in vivo to 

1 5 express the polynucleotides of the invention, wherein such polynucleotides are in operative 
association with a regulatory sequence heterologous to the host cell which drives expression 
of the polynucleotides in the cell. These methods can be used to increase or decrease the 
expression of the polynucleotides of the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of 

20 cells to permit, increase, or decrease, expression of endogenous polypeptide. Cells can be 
modified (e.g., by homologous recombination) to provide increased polypeptide expression 
by replacing, in whole or in part, the naturally occurring promoter with all or part of a 
heterologous promoter so that the cells express the protein at higher levels. The heterologous 
promoter is inserted in such a manner that it is operatively linked to the desired protein 

25 encoding sequences. See, for example, PCT International Publication No. WO 94/12650, 
PCT International Publication No. WO 92/20808, and PCT International Publication No. 
WO 91/09955. It is also contemplated that, in addition to heterologous promoter DNA, 
amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which encodes 
carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron 

30 DNA may be inserted along with the heterologous promoter DNA. If linked to the desired 
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protein coding sequence, amplification of the marker DNA by standard selection methods 
results in co-amplification of the desired protein coding sequences in the cells. 

In another embodiment of the present invention, cells and tissues may be engineered 
to express an endogenous gene comprising the polynucleotides of the invention under the 
5 control of inducible regulatory elements, in which case the regulatory sequences of the 
endogenous gene may be replaced by homologous recombination. As described herein, 
gene targeting can be used to replace a gene's existing regulatory region with a regulatory 
sequence isolated from a different gene or a novel regulatory sequence synthesized by 
genetic engineering methods. Such regulatory sequences may be comprised of promoters, 

10 enhancers, scaffold-attachment regions, negative regulatory elements, transcriptional 
initiation sites, regulatory protein binding sites or combinations of said sequences. 
Alternatively, sequences which affect the structure or stability of the RNA or protein 
produced may be replaced, removed, added, or otherwise modified by targeting. These 
sequences include polyadenylation signals, mRNA stability elements, splice sites, leader 

1 5 sequences for enhancing or modifying transport or secretion properties of the protein, or 
other sequences which alter or improve the function or stability of protein or RNA 
molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the 
gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 

20 enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple 

deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory 
element. Alternatively, the targeting event may replace an existing element; for example, a 
tissue-specific enhancer can be replaced by an enhancer that has broader or different 
cell-type specificity than the naturally occurring elements. Here, the naturally occurring 

25 sequences are deleted and new sequences are added. In all cases, the identification of the 
targeting event may be facilitated by the use of one or more selectable marker genes that are 
contiguous with the targeting DNA, allowing for the selection of cells in which the 
exogenous DNA has integrated into the cell genome. The identification of the targeting 
event may also be facilitated by the use of one or more marker genes exhibiting the property 

30 of negative selection, such that the negatively selectable marker is linked to the exogenous 
DNA, but configured such that the negatively selectable marker flanks the targeting 
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sequence, and such that a correct homologous recombination event with sequences in the 
host cell genome does not result in the stable integration of the negatively selectable marker. 
Markers useful for this purpose include the Herpes Simplex Virus thymidine kinase (TK) 
gene or the bacterial xanthine-guanine phosphoribosyl-transferase (gpt) gene, 
5 The gene targeting or gene activation techniques which can be used in accordance 

with this aspect of the invention are more particularly described in U.S. Patent No. 
5,272,071 to Chappel; U.S. Patent No. 5,578,461 to Sherwin et al; International Application 
No. PCT/US92/09627 (WO93/09222) by Selden et al; and International Application No. 
PCT/US90/06436 (W09 1/06667) by Skoultchi et al, each of which is incorporated by 
1 0 reference herein in its entirety. 

4.6 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed 

1 5 or inactivated in the germ line of animals using homologous recombination [Capecchi, 
Science 244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the 
regulatory control of exogenous or endogenous promoter elements, are known as 
transgenic animals. Animals in which an endogenous gene has been inactivated by 
homologous recombination are referred to as "knockout" animals. Knockout animals, 

20 preferably non-human mammals, can be prepared as described in U.S. Patent No. 

5,557,032, incorporated herein by reference. Transgenic animals are useful to determine 
the roles polypeptides of the invention play in biological processes, and preferably in 
disease states. Transgenic animals are useful as model systems to identify compounds 
that modulate lipid metabolism. Transgenic animals, preferably non-human mammals, 

25 are produced using methods as described in U.S. Patent No 5,489,743 and PCT 
Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 
polynucleotides of the invention is either activated or inactivated to alter the level of 
expression of the polypeptides of the invention. Inactivation can be carried out using 

30 homologous recombination methods described above. Activation can be achieved by 
supplementing or even replacing the homologous promoter to provide for increased 
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protein expression. The homologous promoter can be supplemented by insertion of one 
or more heterologous enhancer elements known to confer promoter activation in a 
particular tissue. 

The polynucleotides of the present invention also make possible the development, 
5 through, e.g., homologous recombination or knock out strategies, of animals that fail to 
express polypeptides of the invention or that express a variant polypeptide. Such animals 
are useful as models for studying the in vivo activities of polypeptide as well as for 
studying modulators of the polypeptides of the invention. 

In preferred methods to determine biological functions of the polypeptides of the 

10 invention in vivo, one or more genes provided by the invention are either over expressed 
or inactivated in the germ line of animals using homologous recombination [Capecchi, 
Science 244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the 
regulatory control of exogenous or endogenous promoter elements, are known as 
transgenic animals. Animals in which an endogenous gene has been inactivated by 

15 homologous recombination are referred to as "knockout" animals. Knockout animals, 
preferably non-human mammals, can be prepared as described in U.S. Patent No. 
5,557,032, incorporated herein by reference. Transgenic animals are useful to determine 
the roles polypeptides of the invention play in biological processes, and preferably in 
disease states. Transgenic animals are useful as model systems to identify compounds 

20 that modulate lipid metabolism. Transgenic animals, preferably non-human mammals, 
are produced using methods as described in U.S. Patent No 5,489,743 and PCT 
Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of 
the invention promoter is either activated or inactivated to alter the level of expression of 

25 the polypeptides of the invention. Inactivation can be carried out using homologous 

recombination methods described above. Activation can be achieved by supplementing or 
even replacing the homologous promoter to provide for increased protein expression. The 
homologous promoter can be supplemented by insertion of one or more heterologous 
enhancer elements known to confer promoter activation in a particular tissue. 

30 

4.7 USES AND BIOLOGICAL ACTIVITY 
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The polynucleotides and proteins of the present invention are expected to exhibit 
one or more of the uses or biological activities (including those associated with assays 
cited herein) identified herein. Uses or activities described for proteins of the present 
invention may be provided by administration or use of such proteins or of 
5 polynucleotides encoding such proteins (such as, for example, in gene therapies or 
vectors suitable for introduction of DNA). The mechanism underlying the particular 
condition or pathology will dictate whether the polypeptides of the invention, the 
polynucleotides of the invention or modulators (activators or inhibitors) thereof would be 
beneficial to the subject in need of treatment. Thus, "therapeutic compositions of the 

10 invention" include compositions comprising isolated polynucleotides (including 
recombinant DNA molecules, cloned genes and degenerate variants thereof) or 
polypeptides of the invention (including full length protein, mature protein and 
truncations or domains thereof), or compounds and other substances that modulate the 
overall activity of the target gene products, either at the level of target gene/protein 

15 expression or target protein activity. Such modulators include polypeptides, analogs, 

(variants), including fragments and fusion proteins, antibodies and other binding proteins; 
chemical compounds that directly or indirectly activate or inhibit the polypeptides of the 
invention (identified, e.g., via drug screening assays as described herein); antisense 
polynucleotides and polynucleotides suitable for triple helix formation; and in particular 

20 antibodies or other binding partners that specifically recognize one or more epitopes of 
the polypeptides of the invention. 

The polypeptides of the present invention may likewise be involved in cellular 
activation or in one of the other physiological pathways described herein. 



25 4.7,1 RESEARCH USES AND UTILITIES 

The polynucleotides provided by the present invention can be used by the 
research community for various purposes. The polynucleotides can be used to express 
recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either 
30 constitutively or at a particular stage of tissue differentiation or development or in disease 
states); as molecular weight markers on gels; as chromosome markers or tags (when 



789CEP2C 



39 



labeled) to identify chromosomes or to map related gene positions; to compare with 
endogenous DNA sequences in patients to identify potential genetic disorders; as probes 
to hybridize and thus discover novel, related DNA sequences; as a source of information 
to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known 
5 sequences in the process of discovering other novel polynucleotides; for selecting and 
making oligomers for attachment to a "gene chip" or other support, including for 
examination of expression patterns; to raise anti-protein antibodies using DNA 
immunization techniques; and as an antigen to raise anti-DNA antibodies or elicit another 
immune response. Where the polynucleotide encodes a protein which binds or 

1 0 potentially binds to another protein (such as, for example, in a receptor-ligand 

interaction), the polynucleotide can also be used in interaction trap assays (such as, for 
example, that described in Gyuris et al., Cell 75:791-803 (1993)) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify 
inhibitors of the binding interaction. 

1 5 The polypeptides provided by the present invention can similarly be used in 

assays to determine biological activity, including in a panel of multiple proteins for 
high-throughput screening; to raise antibodies or to elicit another immune response; as a 
reagent (including the labeled reagent) in assays designed to quantitatively determine 
levels of the protein (or its receptor) in biological fluids; as markers for tissues in which 

20 the corresponding polypeptide is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in a disease state); and, of 
course, to isolate correlative receptors or ligands. Proteins involved in these binding 
interactions can also be used to screen for peptide or small molecule inhibitors or agonists 
of the binding interaction. 

25 Any or all of these research utilities are capable of being developed into reagent 

grade or kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in 
the art. References disclosing such methods include without limitation "Molecular 
Cloning: A Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, 

30 Sambrook, L, E. F. Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: 
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Guide to Molecular Cloning Techniques" , Academic Press, Berger, S. L. and A. R. 
Kimmel eds., 1987. 

4.7.2 NUTRITIONAL USES 

5 Polynucleotides and polypeptides of the present invention can also be used as 

nutritional sources or supplements. Such uses include without limitation use as a protein or 
amino acid supplement, use as a carbon source, use as a nitrogen source and use as a source 
of carbohydrate. In such cases the polypeptide or polynucleotide of the invention can be 
added to the feed of a particular organism or can be administered as a separate solid or liquid 
10 preparation, such as in the form of powder, pills, solutions, suspensions or capsules. In the 
case of microorganisms, the polypeptide or polynucleotide of the invention can be added to 
the medium in or on which the microorganism is cultured. 

4.7.3 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 
15 ACTIVITY 

A polypeptide of the present invention may exhibit activity relating to cytokine, 
cell proliferation (either inducing or inhibiting) or cell differentiation (either inducing or 
inhibiting) activity or may induce production of other cytokines in certain cell 
populations. A polynucleotide of the invention can encode a polypeptide exhibiting such 

20 attributes. Many protein factors discovered to date, including all known cytokines, have 
exhibited activity in one or more factor-dependent cell proliferation assays, and hence the 
assays serve as a convenient confirmation of cytokine activity. The activity of therapeutic 
compositions of the present invention is evidenced by any one of a number of routine 
factor dependent cell proliferation assays for cell lines including, without limitation, 32D, 

25 DA2, DA1G, T10, B9, B9/1 1, BaF3, MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, 

Tl 165, HT2, CTLL2, TF-1, Mo7e, CMK, HUVEC, and Caco. Therapeutic compositions 
of the invention can be used in the following: 

Assays for T-cell or thymocyte proliferation include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. 

30 H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 

Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
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Chapter 7, Immunologic studies in Humans); Takai et al, J. Immunol. 137:3494-3500, 
1986; Bertagnolli et al., J. Immunol 145:1706-1712, 1990; Bertagnolli et al, Cellular 
Immunology 133:327-341, 1991; Bertagnolli, et al, L Immunol. 149:3778-3783, 1992; 
Bowman et al., I. Immunol. 152:1756-1761, 1994. 
5 Assays for cytokine production and/or proliferation of spleen cells, lymph node 

cells or thymocytes include, without limitation, those described in: Polyclonal T cell 
stimulation, Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. 
J. E. e.a. Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and 
Measurement of mouse and human interleukin- , Schreiber, R. D. In Current Protocols 
10 in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, 
Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic 
cells include, without limitation, those described in: Measurement of Human and Murine 
Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current 

15 Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6,3.1-6.3.12, John Wiley and 
Sons, Toronto. 1991; deVries et al, J. Exp. Med. 173:1205-1211, 1991; Moreau et al., 
Nature 336:690-692, 1988; Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 
80:2931-2938, 1983; Measurement of mouse and human interleukin 6-Nordan, R. In 
Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.6.1-6.6,5, John Wiley 

20 and Sons, Toronto. 1991; Smith et al, Proc. Natl. Aced. Sci. U.S.A. 83:1857-1861, 1986; 
Measurement of human Interleukin 11 -Bennett, F., Giannotti, J., Clark, S. C. and Turner, 
K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.15.1 John 
Wiley and Sons, Toronto. 1991; Measurement of mouse and human Interleukin 
9--Ciarletta, A., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in 

25 Immunology. J. E. Coligan eds. Vol 1 pp. 6.13,1, John Wiley and Sons, Toronto. 1991. 

Assays for T-cell clone responses to antigens (which will identify, among others, 
proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring 
proliferation and cytokine production) include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. 

30 Margulies, E. M. Shevach, W Strober, Pub. Greene Publishing Associates and 

Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 
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6, Cytokines and their cellular receptors; Chapter 7, Immunologic studies in Humans); 
Weinberger et aL, Proc. Natl Acad. Sci. USA 77:6091-6095, 1980; Weinberger et al 5 
Eur. J. hnmun. 11:405-411, 1981; Takai et aL, I Immunol. 137:3494-3500, 1986; Takai 
et aL, J. Immunol. 140:508-512, 1988. 

5 

4.7.4 STEM CELL GROWTH FACTOR ACTIVITY 

A polypeptide of the present invention may exhibit stem cell growth factor 
activity and be involved in the proliferation, differentiation and survival of pluripotent 
and totipotent stem cells including primordial germ cells, embryonic stem cells, 

10 hematopoietic stem cells and/or germ line stem cells. Administration of the polypeptide 
of the invention to stem cells in vivo or ex vivo is expected to maintain and expand cell 
populations in a totipotential or pluripotential state which would be useful for re- 
engineering damaged or diseased tissues, transplantation, manufacture of bio- 
pharmaceuticals and the development of bio-sensors. The ability to produce large 

1 5 quantities of human cells has important working applications for the production of human 
proteins which currently must be obtained from non-human sources or donors, 
implantation of cells to treat diseases such as Parkinson's, Alzheimer's and other 
neurodegenerative diseases; tissues for grafting such as bone marrow, skin, cartilage, 
tendons, bone, muscle (including cardiac muscle), blood vessels, cornea, neural cells, 

20 gastrointestinal cells and others; and organs for transplantation such as kidney, liver, 
pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or 
cytokines may be administered in combination with the polypeptide of the invention to 
achieve the desired effect, including any of the growth factors listed herein, other stem 

25 cell maintenance factors, and specifically including stem cell factor (SCF), leukemia 

inhibitory factor (LIF), Flt-3 ligand (Flt-3L), any of the interleukins, recombinant soluble 
IL-6 receptor fused to IL-6, macrophage inflammatory protein 1 -alpha (MIP-1-alpha), G- 
CSF, GM-CSF, thrombopoietin (TPO), platelet factor 4 (PF-4), platelet-derived growth 
factor (PDGF), neural growth factors and basic fibroblast growth factor (bFGF). 

30 Since totipotent stem cells can give rise to virtually any mature cell type, 

expansion of these cells in culture will facilitate the production of large quantities of 
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mature cells. Techniques for culturing stem cells are known in the art and administration 
of polypeptides of the invention, optionally with other growth factors and/or cytokines, is 
expected to enhance the survival and proliferation of the stem cell populations. This can 
be accomplished by direct administration of the polypeptide of the invention to the 
5 culture medium. Alternatively, stroma cells transfected with a polynucleotide that 
encodes for the polypeptide of the invention can be used as a feeder layer for the stem 
cell populations in culture or in vivo. Stromal support cells for feeder layers may include 
embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or 
cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926). 

10 Stem cells themselves can be transfected with a polynucleotide of the invention to 

induce autocrine expression of the polypeptide of the invention. This will allow for 
generation of undifferentiated totipotential/pluripotential stem cell lines that are useful as 
is or that can then be differentiated into the desired mature cell types. These stable cell 
lines can also serve as a source of undifferentiated totipotential/pluripotential mRNA to 

15 create cDNA libraries and templates for polymerase chain reaction experiments. These 
studies would allow for the isolation and identification of differentially expressed genes 
in stem cell populations that regulate stem cell proliferation and/or maintenance. 

Expansion and maintenance of totipotent stem cell populations will be useful in 
the treatment of many pathological conditions. For example, polypeptides of the present 

20 invention may be used to manipulate stem cells in culture to give rise to neuroepithelial 
cells that can be used to augment or replace cells damaged by illness, autoimmune 
disease, accidental damage or genetic disorders. The polypeptide of the invention may be 
useful for inducing the proliferation of neural cells and for the regeneration of nerve and 
brain tissue, i.e. for the treatment of central and peripheral nervous system diseases and 

25 neuropathies, as well as mechanical and traumatic disorders which involve degeneration, 
death or trauma to neural cells or nerve tissue. In addition, the expanded stem cell 
populations can also be genetically altered for gene therapy purposes and to decrease host 
rejection of replacement tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also 

30 be manipulated to achieve controlled differentiation of the stem cells into more 

differentiated cell types. A broadly applicable method of obtaining pure populations of a 
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specific differentiated cell type from undifferentiated stem cell populations involves the 
use of a cell-type specific promoter driving a selectable marker. The selectable marker 
allows only cells of the desired type to survive. For example, stem cells can be induced 
to differentiate into cardiomyocytes (Wobus et aL, Differentiation, 48: 173-182, (1991); 
5 Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) or skeletal muscle cells (Browder, L. 
W. In: Principles of Tissue Engineering eds. Lanza et aL, Academic Press (1997)). 
Alternatively, directed differentiation of stem cells can be accomplished by culturing the 
stem cells in the presence of a differentiation factor such as retinoic acid and an 
antagonist of the polypeptide of the invention which would inhibit the effects of 

10 endogenous stem cell factor activity and allow differentiation to proceed. 

In vitro cultures of stem cells can be used to determine if the polypeptide of the 
invention exhibits stem cell growth factor activity. Stem cells are isolated from any one 
of various cell sources (including hematopoietic stem cells and embryonic stem cells) and 
cultured on a feeder layer, as described by Thompson et aL Proc. Natl Acad. Sci, U.S.A., 

15 92: 7844-7848 (1995), in the presence of the polypeptide of the invention alone or in 

combination with other growth factors or cytokines. The ability of the polypeptide of the 
invention to induce stem cells proliferation is determined by colony formation on semi- 
solid support e.g. as described by Bernstein et al., Blood, 77: 2316-2321 (1991), 

20 4.7.5 HEMATOPOIESIS REGULATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of 
hematopoiesis and, consequently, in the treatment of myeloid or lymphoid cell disorders. 
Even marginal biological activity in support of colony forming cells or of 
factor-dependent cell lines indicates involvement in regulating hematopoiesis, e.g. in 

25 supporting the growth and proliferation of erythroid progenitor cells alone or in 

combination with other cytokines, thereby indicating utility, for example, in treating 
various anemias or for use in conjunction with irradiation/chemotherapy to stimulate the 
production of erythroid precursors and/or erythroid cells; in supporting the growth and 
proliferation of myeloid cells such as granulocytes and monocytes/macrophages (i.e., 

30 traditional CSF activity) useful, for example, in conjunction with chemotherapy to 

prevent or treat consequent myelo-suppression; in supporting the growth and proliferation 
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of megakaryocytes and consequently of platelets thereby allowing prevention or 
treatment of various platelet disorders such as thrombocytopenia, and generally for use in 
place of or complimentary to platelet transfusions; and/or in supporting the growth and 
proliferation of hematopoietic stem cells which are capable of maturing to any and all of 
5 the above-mentioned hematopoietic cells and therefore find therapeutic utility in various 
stem cell disorders (such as those usually treated with transplantation, including, without 
limitation, aplastic anemia and paroxysmal nocturnal hemoglobinuria), as well as in 
repopulating the stem cell compartment post irradiation/chemotherapy, either in-vivo or 
ex-vivo (i.e., in conjunction with bone marrow transplantation or with peripheral 
10 progenitor cell transplantation (homologous or heterologous)) as normal cells or 
genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 
Suitable assays for proliferation and differentiation of various hematopoietic lines 
are cited above. 

15 Assays for embryonic stem cell differentiation (which will identify, among others, 

proteins that influence embryonic differentiation hematopoiesis) include, without 
limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller 
et al, Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al, Blood 
81:2903-2915, 1993. 

20 Assays for stem cell survival and differentiation (which will identify, among 

others, proteins that regulate lympho-hematopoiesis) include, without limitation, those 
described in: Methylcellulose colony forming assays, Freshney, M. G. In Culture of 
Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New 
York, N.Y. 1994; Hirayama et al., Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; 

25 Primitive hematopoietic colony forming cells with high proliferative potential, McNiece, 
I. K. and Briddell, R. A. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol 
pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et al, Experimental 
Hematology 22:353-359, 1994; Cobblestone area forming cell assay, Ploemacher, R. E. 
In Culture of Hematopoietic Cells. R. L Freshney, et al. eds. Vol pp. 1-21, Wiley-Liss, 

30 Inc., New York, N.Y. 1994; Long term bone marrow cultures in the presence of stromal 
cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R. I. 
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Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, N.Y. 1994; Long term 
culture initiating cell assay, Sutherland, H. J. In Culture of Hematopoietic Cells. R. I. 
Freshney, et al. eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, NY. 1994. 

5 4.7,6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, 
tendon, ligament and/or nerve tissue growth or regeneration, as well as in wound healing 
and tissue repair and replacement, and in healing of burns, incisions and ulcers. 

A polypeptide of the present invention which induces cartilage and/or bone 

1 0 growth in circumstances where bone is not normally formed, has application in the 

healing of bone fractures and cartilage damage or defects in humans and other animals. 
Compositions of a polypeptide, antibody, binding partner, or other modulator of the 
invention may have prophylactic use in closed as well as open fracture reduction and also 
in the improved fixation of artificial joints. De novo bone formation induced by an 

1 5 osteogenic agent contributes to the repair of congenital, trauma induced, or oncologic 
resection induced craniofacial defects, and also is useful in cosmetic plastic surgery. 

A polypeptide of this invention may also be involved in attracting bone- forming 
cells, stimulating growth of bone-forming cells, or inducing differentiation of progenitors 
of bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative 

20 disorders, or periodontal disease, such as through stimulation of bone and/or cartilage 
repair or by blocking inflammation or processes of tissue destruction (collagenase 
activity, osteoclast activity, etc.) mediated by inflammatory processes may also be 
possible using the composition of the invention. 

Another category of tissue regeneration activity that may involve the polypeptide 

25 of the present invention is tendon/ligament formation. Induction of tendon/ligament-like 
tissue or other tissue formation in circumstances where such tissue is not normally 
formed, has application in the healing of tendon or ligament tears, deformities and other 
tendon or ligament defects in humans and other animals. Such a preparation employing a 
tendon/ligament-like tissue inducing protein may have prophylactic use in preventing 

30 damage to tendon or ligament tissue, as well as use in the improved fixation of tendon or 
ligament to bone or other tissues, and in repairing defects to tendon or ligament tissue. 
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De novo tendon/ligament-like tissue formation induced by a composition of the present 
invention contributes to the repair of congenital, trauma induced, or other tendon or 
ligament defects of other origin, and is also useful in cosmetic plastic surgery for 
attachment or repair of tendons or ligaments. The compositions of the present invention 
5 may provide environment to attract tendon- or ligament-forming cells, stimulate growth 
of tendon- or ligament-forming cells, induce differentiation of progenitors of tendon- or 
ligament- forming cells, or induce growth of tendon/ligament cells or progenitors ex vivo 
for return in vivo to effect tissue repair. The compositions of the invention may also be 
useful in the treatment of tendinitis, carpal tunnel syndrome and other tendon or ligament 

10 defects. The compositions may also include an appropriate matrix and/or sequestering 
agent as a carrier as is well known in the art. 

The compositions of the present invention may also be useful for proliferation of 
neural cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central 
and peripheral nervous system diseases and neuropathies, as well as mechanical and 

15 traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve 
tissue. More specifically, a composition may be used in the treatment of diseases of the 
peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and 
localized neuropathies, and central nervous system diseases, such as Alzheimer's, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager 

20 syndrome. Further conditions which may be treated in accordance with the present 

invention include mechanical and traumatic disorders, such as spinal cord disorders, head 
trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting 
from chemotherapy or other medical therapies may also be treatable using a composition 
of the invention. 

25 Compositions of the invention may also be useful to promote better or faster 

closure of non-healing wounds, including without limitation pressure ulcers, ulcers 
associated with vascular insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 
regeneration of other tissues, such as organs (including, for example, pancreas, liver, 

30 intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular 
(including vascular endothelium) tissue, or for promoting the growth of cells comprising 
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such tissues. Part of the desired effects may be by inhibition or modulation of fibrotic 
scarring may allow normal tissue to regenerate. A polypeptide of the present invention 
may also exhibit angiogenic activity. 

A composition of the present invention may also be useful for gut protection or 
5 regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, 
and conditions resulting from systemic cytokine damage. 

A composition of the present invention may also be useful for promoting or 
inhibiting differentiation of tissues described above from precursor tissues or cells; or for 
inhibiting the growth of tissues described above. 
10 Therapeutic compositions of the invention can be used in the following: 

Assays for tissue generation activity include, without limitation, those described 
in: International Patent Publication No. WO95/16035 (bone, cartilage, tendon); 
International Patent Publication No. WO95/05846 (nerve, neuronal); International Patent 
Publication No. WO91/07491 (skin, endothelium). 
15 Assays for wound healing activity include, without limitation, those described in: 

Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T,, eds.), 
Year Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. 
Invest Dermatol 71:382-84 (1978). 



20 4.7.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or 
immune suppressing activity, including without limitation the activities for which assays 
are described herein. A polynucleotide of the invention can encode a polypeptide 
exhibiting such activities. A protein may be useful in the treatment of various immune 

25 deficiencies and disorders (including severe combined immunodeficiency (SCID)), e.g., 
in regulating (up or down) growth and proliferation of T and/or B lymphocytes, as well as 
effecting the cytolytic activity of NK cells and other cell populations. These immune 
deficiencies may be genetic or be caused by viral (e.g., HIV) as well as bacterial or 
fungal infections, or may result from autoimmune disorders. More specifically, infectious 

30 diseases causes by viral, bacterial, fungal or other infection may be treatable using a 
protein of the present invention, including infections by HIV, hepatitis viruses, herpes 
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viruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such 
as candidiasis. Of course, in this regard, proteins of the present invention may also be 
useful where a boost to the immune system generally may be desirable, i.e., in the 
treatment of cancer. 

5 Autoimmune disorders which may be treated using a protein of the present 

invention include, for example, connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, 
Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, 
myasthenia gravis, graft-versus-host disease and autoimmune inflammatory eye disease. 

10 Such a protein (or antagonists thereof, including antibodies) of the present invention may 
also to be useful in the treatment of allergic reactions and conditions (e.g., anaphylaxis, 
serum sickness, drug reactions, food allergies, insect venom allergies, mastocytosis, 
allergic rhinitis, hypersensitivity pneumonitis, urticaria, angioedema, eczema, atopic 
dermatitis, allergic contact dermatitis, erythema multiforme, Stevens- Johnson syndrome, 

15 allergic conjunctivitis, atopic keratoconjunctivitis, venereal keratoconjunctivitis, giant 
papillary conjunctivitis and contact allergies), such as asthma (particularly allergic 
asthma) or other respiratory problems. Other conditions, in which immune suppression is 
desired (including, for example, organ transplantation), may also be treatable using a 
protein (or antagonists thereof) of the present invention. The therapeutic effects of the 

20 polypeptides or antagonists thereof on allergic reactions can be evaluated by in vivo 
animals models such as the cumulative contact enhancement test (Lastbom et al., 
Toxicology 125: 59-66, 1998), skin prick test (Hoffmann et al, Allergy 54: 446-54, 
1999), guinea pig skin sensitization test (Vohr et al, Arch. Toxocol. 73: 501-9), and 
murine local lymph node assay (Kimber et al., J. Toxicol. Environ. Health 53: 563-79). 

25 Using the proteins of the invention it may also be possible to modulate immune 

responses, in a number of ways. Down regulation may be in the form of inhibiting or 
blocking an immune response already in progress or may involve preventing the 
induction of an immune response. The functions of activated T cells may be inhibited by 
suppressing T cell responses or by inducing specific tolerance in T cells, or both. 

30 Immunosuppression of T cell responses is generally an active, non-antigen-specific, 
process which requires continuous exposure of the T cells to the suppressive agent. 
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Tolerance, which involves inducing non-responsiveness or anergy in T cells, is 
distinguishable from immunosuppression in that it is generally antigen-specific and 
persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be 
demonstrated by the lack of a T cell response upon reexposure to specific antigen in the 
5 absence of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 
limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing 
high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, 
skin and organ transplantation and in graft-versus-host disease (GVHD). For example, 

10 blockage of T cell function should result in reduced tissue destruction in tissue 

transplantation. Typically, in tissue transplants, rejection of the transplant is initiated 
through its recognition as foreign by T cells, followed by an immune reaction that 
destroys the transplant. The administration of a therapeutic composition of the invention 
may prevent cytokine synthesis by immune cells, such as T cells, and thus acts as an 

1 5 immunosuppressant. Moreover, a lack of costimulation may also be sufficient to anergize 
the T cells, thereby inducing tolerance in a subject. Induction of long-term tolerance by B 
lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration 
of these blocking reagents. To achieve sufficient immunosuppression or tolerance in a 
subject, it may also be necessary to block the function of a combination of B lymphocyte 

20 antigens. 

The efficacy of particular therapeutic compositions in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been 

25 used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as 
described in Lenschow et al., Science 257:789-792 (1992) and Turka et al., Proc. Natl. 
Acad. Sci USA, 89:11102-11105 (1992). In addition, murine models of GVHD (see Paul 
ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used 
to determine the effect of therapeutic compositions of the invention on the development 

30 of that disease. 
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Blocking antigen function may also be therapeutically useful for treating 
autoimmune diseases. Many autoimmune disorders are the result of inappropriate 
activation of T cells that are reactive against self tissue and which promote the production 
of cytokines and autoantibodies involved in the pathology of the diseases. Preventing the 
5 activation of autoreactive T cells may reduce or eliminate disease symptoms. 

Administration of reagents which block stimulation of T cells can be used to inhibit T 
cell activation and prevent production of autoantibodies or T cell-derived cytokines 
which may be involved in the disease process. Additionally, blocking reagents may 
induce antigen-specific tolerance of autoreactive T cells which could lead to long-term 

10 relief from the disease. The efficacy of blocking reagents in preventing or alleviating 
autoimmune disorders can be determined using a number of well-characterized animal 
models of human autoimmune diseases. Examples include murine experimental 
autoimmune encephalitis, systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB 
hybrid mice, murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and 

15 BB rats, and murine experimental myasthenia gravis (see Paul ed., Fundamental 
Immunology, Raven Press, New York, 1989, pp. 840-856). 

Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a 
means of up regulating immune responses, may also be useful in therapy. Upregulation of 
immune responses may be in the form of enhancing an existing immune response or 

20 eliciting an initial immune response. For example, enhancing an immune response may 
be useful in cases of viral infection, including systemic viral diseases such as influenza, 
the common cold, and encephalitis. 

Alternatively, anti-viral immune responses may be enhanced in an infected patient 
by removing T cells from the patient, costimulating the T cells in vitro with viral 

25 antigen-pulsed APCs either expressing a peptide of the present invention or together with 
a stimulatory form of a soluble peptide of the present invention and reintroducing the in 
vitro activated T cells into the patient. Another method of enhancing anti- viral immune 
responses would be to isolate infected cells from a patient, transfect them with a nucleic 
acid encoding a protein of the present invention as described herein such that the cells 

30 express all or a portion of the protein on their surface, and reintroduce the transfected 
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cells into the patient. The infected cells would now be capable of delivering a 
costimulatory signal to, and thereby activate, T cells in vivo. 

A polypeptide of the present invention may provide the necessary stimulation 
signal to T cells to induce a T cell mediated immune response against the transfected 

5 tumor cells. In addition, tumor cells which lack MHC class I or MHC class II molecules, 
or which fail to reexpress sufficient mounts of MHC class I or MHC class II molecules, 
can be transfected with nucleic acid encoding all or a portion of (e.g., a 
cytoplasmic-domain truncated portion) of an MHC class I alpha chain protein and p 2 
microglobulin protein or an MHC class II alpha chain protein and an MHC class II beta 

1 0 chain protein to thereby express MHC class I or MHC class II proteins on the cell 
surface. Expression of the appropriate class I or class II MHC in conjunction with a 
peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a 
T cell mediated immune response against the transfected tumor cell. Optionally, a gene 
encoding an antisense construct which blocks expression of an MHC class II associated 

1 5 protein, such as the invariant chain, can also be cotransfected with a DNA encoding a 
peptide having the activity of a B lymphocyte antigen to promote presentation of tumor 
associated antigens and induce tumor specific immunity. Thus, the induction of a T cell 
mediated immune response in a human subject may be sufficient to overcome 
tumor-specific tolerance in the subject. 

20 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without 
limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. 
M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing 

25 Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte 
Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Herrmann et al, Proc. 
Natl. Acad. Sci. USA 78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 
1982; Handa et al, J. Immunol. 135:1564-1572, 1985; Takai et al., I. Immunol. 
137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988; Bowman et al., J. 

30 Virology 61:1992-1998; Bertagnolli et al., Cellular Immunology 133:327-341, 1991; 
Brown et al., J. Immunol. 153:3079-3092, 1994. 
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Assays for T-cell-dependent immunoglobulin responses and isotype switching 
(which will identify, among others, proteins that modulate T-cell dependent antibody 
responses and that affect Thl/Th2 profiles) include, without limitation, those described 
in: Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In 
5 vitro antibody production, Mond, J. J. and Brunswick, M. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, 
Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 
proteins that generate predominantly Thl and CTL responses) include, without limitation, 

1 0 those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. 
Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing 
Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte 
Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 
137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., J. 

15 Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins 
expressed by dendritic cells that activate naive T-cells) include, without limitation, those 
described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of 
Experimental Medicine 173:549-559, 1991; Macatonia et al., Journal of Immunology 

20 1 54:5071-5079, 1995; Porgador et al., Journal of Experimental Medicine 1 82:255-260, 
1995; Nair et al., Journal of Virology 67:4062-4069, 1993; Huang et al., Science 
264:961-965, 1994; Macatonia et al, Journal of Experimental Medicine 169:1255-1264, 
1989; Bhardwaj et al, Journal of Clinical Investigation 94:797-807, 1994; and Inaba et 
al, Journal of Experimental Medicine 172:631-640, 1990. 

25 Assays for lymphocyte survival/apoptosis (which will identify, among others, 

proteins that prevent apoptosis after superantigen induction and proteins that regulate 
lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz 
et al., Cytometry 13:795-808, 1992; Gorczyca et al, Leukemia 7:659-670, 1993; 
Gorczyca et al., Cancer Research 53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; 

30 Zacharchuk, Journal of Immunology 145:4037-4045, 1990; Zamai et al., Cytometry 
14:891-897, 1993; Gorczyca et al., International Journal of Oncology 1:639-648, 1992. 
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Assays for proteins that influence early steps of T-cell commitment and 
development include, without limitation, those described in: Antica et al., Blood 
84:111-117, 1994; Fine et al, Cellular Immunology 155:111-122, 1994; Galy et al., 
Blood 85:2770-2778, 1995; Toki et al., Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 

5 

4.7.8 ACTIVIN/INHIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 

1 0 stimulating hormone (FSH), while activins and are characterized by their ability to 

stimulate the release of follicle stimulating hormone (FSH). Thus, a polypeptide of the 
present invention, alone or in heterodimers with a member of the inhibin family, may be 
useful as a contraceptive based on the ability of inhibins to decrease fertility in female 
mammals and decrease spermatogenesis in male mammals. Administration of sufficient 

1 5 amounts of other inhibins can induce infertility in these mammals. Alternatively, the 
polypeptide of the invention, as a homodimer or as a heterodimer with other protein 
subunits of the inhibin group, maybe useful as a fertility inducing therapeutic, based 
upon the ability of activin molecules in stimulating FSH release from cells of the anterior 
pituitary. See, for example, U.S. Pat. No. 4,798,885. A polypeptide of the invention may 

20 also be useful for advancement of the onset of fertility in sexually immature mammals, so 
as to increase the lifetime reproductive performance of domestic animals such as, but not 
limited to, cows, sheep and pigs. 

The activity of a polypeptide of the invention may, among other means, be 
measured by the following methods. 

25 Assays for activin/inhibin activity include, without limitation, those described in: 

Vale et al., Endocrinology 91 :562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale 
et al., Nature 321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., 
Proc. Natl. Acad. Sci. USA 83:3091-3095, 1986. 

30 4.7.9 CHEMOTACTIC/CHEMOKINETIC ACTIVITY 
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A polypeptide of the present invention may be involved in chemotactic or 
chemokinetic activity for mammalian cells, including, for example, monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial 
cells. A polynucleotide of the invention can encode a polypeptide exhibiting such 
5 attributes. Chemotactic and chemokinetic receptor activation can be used to mobilize or 
attract a desired cell population to a desired site of action. Chemotactic or chemokinetic 
compositions (e.g. proteins, antibodies, binding partners, or modulators of the invention) 
provide particular advantages in treatment of wounds and other trauma to tissues, as well 
as in treatment of localized infections. For example, attraction of lymphocytes, 

1 0 monocytes or neutrophils to tumors or sites of infection may result in improved immune 
responses against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it 
can stimulate, directly or indirectly, the directed orientation or movement of such cell 
population. Preferably, the protein or peptide has the ability to directly stimulate directed 

1 5 movement of cells. Whether a particular protein has chemotactic activity for a population 
of cells can be readily determined by employing such protein or peptide in any known 
assay for cell chemotaxis. 

Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or 

20 prevent chemotaxis) consist of assays that measure the ability of a protein to induce the 
migration of cells across a membrane as well as the ability of a protein to induce the 
adhesion of one cell population to another cell population. Suitable assays for movement 
and adhesion include, without limitation, those described in: Current Protocols in 
Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. 

25 Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 6.12, 

Measurement of alpha and beta Chemokines 6.12.1-6.12.28; Taub et al. J. Clin. Invest. 
95:1370-1376, 1995; Lind et al. APMIS 103:140-146, 1995; Muller et al Eur. J. 
Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 1994; Johnston et 
al J. of Immunol. 153:1762-1768, 1994. 

30 

4.7.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 
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A polypeptide of the invention may also be involved in hemostatis or 
thrombolysis or thrombosis. A polynucleotide of the invention can encode a polypeptide 
exhibiting such attributes. Compositions may be useful in treatment of various 
coagulation disorders (including hereditary disorders, such as hemophilias) or to enhance 

5 coagulation and other hemostatic events in treating wounds resulting from trauma, 

surgery or other causes. A composition of the invention may also be useful for dissolving 
or inhibiting formation of thromboses and for treatment and prevention of conditions 
resulting therefrom (such as, for example, infarction of cardiac and central nervous 
system vessels (e.g., stroke). 

10 Therapeutic compositions of the invention can be used in the following: 

Assay for hemostatic and thrombolytic activity include, without limitation, those 
described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al. ? 
Thrombosis Res. 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); 
Schaub, Prostaglandins 35:467-474, 1988. 

15 

4.7.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation, 
proliferation or metastasis. Detection of the presence or amount of polynucleotides or 
polypeptides of the invention maybe useful for the diagnosis and/or prognosis of one or 

20 more types of cancer. For example, the presence or increased expression of a 

polynucleotide/polypeptide of the invention may indicate a hereditary risk of cancer, a 
precancerous condition, or an ongoing malignancy. Conversely, a defect in the gene or 
absence of the polypeptide may be associated with a cancer condition. Identification of 
single nucleotide polymorphisms associated with cancer or a predisposition to cancer 

25 may also be useful for diagnosis or prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell 
proliferation, inhibiting angiogenesis (growth of new blood vessels that is necessary to 
support tumor growth) and/or prohibiting metastasis by reducing tumor cell motility or 
invasiveness. Therapeutic compositions of the invention may be effective in adult and 

30 pediatric oncology including in solid phase tumors/malignancies, locally advanced 

tumors, human soft tissue sarcomas, metastatic cancer, including lymphatic metastases, 
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blood cell malignancies including multiple myeloma, acute and chronic leukemias, and 
lymphomas, head and neck cancers including mouth cancer, larynx cancer and thyroid 
cancer, lung cancers including small cell carcinoma and non-small cell cancers, breast 
cancers including small cell carcinoma and ductal carcinoma, gastrointestinal cancers 

5 including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 
associated with colorectal neoplasia, pancreatic cancers, liver cancer, urologic cancers 
including bladder cancer and prostate cancer, malignancies of the female genital tract 
including ovarian carcinoma, uterine (including endometrial) cancers, and solid tumor in 
the ovarian follicle, kidney cancers including renal cell carcinoma, brain cancers 

1 0 including intrinsic brain tumors, neuroblastoma, astrocytic brain tumors, gliomas, 
metastatic tumor cell invasion in the central nervous system, bone cancers including 
osteomas, skin cancers including malignant melanoma, tumor progression of human skin 
keratinocytes, squamous cell carcinoma, basal cell carcinoma, hemangiopericytoma and 
Karposi's sarcoma. 

1 5 Polypeptides, polynucleotides, or modulators of polypeptides of the invention 

(including inhibitors and stimulators of the biological activity of the polypeptide of the 
invention) may be administered to treat cancer. Therapeutic compositions can be 
administered in therapeutically effective dosages alone or in combination with adjuvant 
cancer therapy such as surgery, chemotherapy, radiotherapy, thermotherapy, and laser 

20 therapy, and may provide a beneficial effect, e.g. reducing tumor size, slowing rate of 
tumor growth, inhibiting metastasis, or otherwise improving overall clinical condition, 
without necessarily eradicating the cancer. 

The composition can also be administered in therapeutically effective amounts as 
a portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the 

25 polypeptide or modulator of the invention with one or more anti-cancer drugs in addition 
to a pharmaceutical^ acceptable carrier for delivery. The use of anti-cancer cocktails as 
a cancer treatment is routine. Anti-cancer drugs that are well known in the art and can be 
used as a treatment in combination with the polypeptide or modulator of the invention 
include: Actinomycin D, Aminoglutethimide, Asparaginase, Bleomycin, Busulfan, 

30 Carboplatin, Carmustine, Chlorambucil, Cisplatin (cis-DDP), Cyclophosphamide, 

Cytarabine HC1 (Cytosine arabinoside), Dacarbazine, Dactinomycin, Daunorubicin HC1, 
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Doxorubicin HC1, Estramustine phosphate sodium, Etoposide (VI 6-2 13), Floxuridine, 5- 
Fluorouracil (5-Fu), Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, Interferon 
Alpha-2a, Interferon Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), 
Lomustine, Mechlorethamine HC1 (nitrogen mustard), Melphalan, Mercaptopurine, 
5 Mesna, Methotrexate (MTX), Mitomycin, Mitoxantrone HC1, Octreotide, Plicamycin, 
Procarbazine HC1, Streptozocin, Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine 
sulfate, Vincristine sulfate, Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, 
Mitoguazone, Pentostatin, Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for 

10 prophylactic treatment of cancer. There are hereditary conditions and/or environmental 
situations (e.g. exposure to carcinogens) known in the art that predispose an individual to 
developing cancers. Under these circumstances, it may be beneficial to treat these 
individuals with therapeutically effective doses of the polypeptide of the invention to 
reduce the risk of developing cancers. 

15 In vitro models can be used to determine the effective doses of the polypeptide of 

the invention as a potential cancer treatment. These in vitro models include proliferation 
assays of cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, 
(1987) Culture of Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, 
NY Ch 18 and Ch 21), tumor systems in nude mice as described in Giovanella et al., J. 

20 Natl. Can. Inst., 52: 921-30 (1974), mobility and invasive potential of tumor cells in 
Boyden Chamber assays as described in Pilkington et al., Anticancer Res., 17: 4107-9 
(1997), and angiogenesis assays such as induction of vascularization of the chick 
chorioallantoic membrane or induction of vascular endothelial cell migration as described 
in Ribatta et al., Intl. J. Dev. Biol., 40: 1 189-97 (1999) and Li et al., Clin. Exp. 

25 Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines are available, e.g. 
from American Type Tissue Culture Collection catalogs. 

4.7.12 RECEPTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, 
30 receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide 
of the invention can encode a polypeptide exhibiting such characteristics. Examples of 
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such receptors and ligands include, without limitation, cytokine receptors and their 
ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, 
receptors involved in cell-cell interactions and their ligands (including without limitation, 
cellular adhesion molecules (such as selectins, integrins and their ligands) and 
5 receptor/ligand pairs involved in antigen presentation, antigen recognition and 

development of cellular and humoral immune responses. Receptors and ligands are also 
useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/ligand interaction. A protein of the present invention (including, without 
limitation, fragments of receptors and ligands) may themselves be useful as inhibitors of 

1 0 receptor/ligand interactions. 

The activity of a polypeptide of the invention may, among other means, be 
measured by the following methods: 

Suitable assays for receptor-ligand activity include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, 

15 D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley- Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static 
conditions 7.28.1- 7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; 
Bierer et al., J. Exp. Med. 168:1145-1156, 1988; Rosenstein et al., J. Exp. Med. 
169:149-160 1989; Stoltenborg et al, J. Immunol. Methods 175:59-68, 1994; Stitt et al, 

20 Cell 80:661-670, 1995. 

By way of example, the polypeptides of the invention may be used as a receptor 
for a ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may 
be identified through binding assays, affinity chromatography, dihybrid screening assays, 
BIAcore assays, gel overlay assays, or other methods known in the art. 

25 Studies characterizing drugs or proteins as agonist or antagonist or partial agonists 

or a partial antagonist require the use of other proteins as competing ligands. The 
polypeptides of the present invention or ligand(s) thereof may be labeled by being 
coupled to radioisotopes, colorimetric molecules or a toxin molecules by conventional 
methods. ("Guide to Protein Purification" Murray P. Deutscher (ed) Methods in 

30 Enzymology Vol. 182 (1990) Academic Press, Inc. San Diego). Examples of 

radioisotopes include, but are not limited to, tritium and carbon- 14 . Examples of 



789CIP2C 



60 



colorimetric molecules include, but are not limited to, fluorescent molecules such as 
fluorescamine, or rhodamine or other colorimetric molecules. Examples of toxins 
include, but are not limited, to ricin. 

5 4.7.13 DRUG SCREENING 

This invention is particularly useful for screening chemical compounds by using 
the novel polypeptides or binding fragments thereof in any of a variety of drug screening 
techniques. The polypeptides or fragments employed in such a test may either be free in 
solution, affixed to a solid support, borne on a cell surface or located intracellularly. One 

1 0 method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the polypeptide or a fragment 
thereof. Drugs are screened against such transformed cells in competitive binding assays. 
Such cells, either in viable or fixed form, can be used for standard binding assays. One 
may measure, for example, the formation of complexes between polypeptides of the 

1 5 invention or fragments and the agent being tested or examine the diminution in complex 
formation between the novel polypeptides and an appropriate cell line, which are well 
known in the art. 

Sources for test compounds that may be screened for ability to bind to or 
modulate (i.e., increase or decrease) the activity of polypeptides of the invention include 
20 (1) inorganic and organic chemical libraries, (2) natural product libraries, and (3) 

combinatorial libraries comprised of either random or mimetic peptides, oligonucleotides 
or organic molecules. 

Chemical libraries may be readily synthesized or purchased from a number of 
commercial sources, and may include structural analogs of known compounds or 
25 compounds that are identified as "hits" or "leads" via natural product screening. 

The sources of natural product libraries are microorganisms (including bacteria 
and fungi), animals, plants or other vegetation, or marine organisms, and libraries of 
mixtures for screening may be created by; (1) fermentation and extraction of broths from 
soil, plant or marine microorganisms or (2) extraction of the organisms themselves. 
30 Natural product libraries include polyketides, non-ribosomal peptides, and (non-naturally 
occurring) variants thereof. For a review, see Science 282:63-6% (1998). 
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Combinatorial libraries are composed of large numbers of peptides, 
oligonucleotides or organic compounds and can be readily prepared by traditional 
automated synthesis methods, PCR, cloning or proprietary synthetic methods. Of 
particular interest are peptide and oligonucleotide combinatorial libraries. Still other 
libraries of interest include peptide, protein, peptidomimetic, multiparallel synthetic 
collection, recombinatorial, and polypeptide libraries. For a review of combinatorial 
chemistry and libraries created therefrom, see Myers, Curr. Opin. Biotechnol. 8:701-707 
(1997). For reviews and examples of peptidomimetic libraries, see Al-Obeidi et al., Mol. 
Biotechnol, 9(3):205-23 (1998); Hruby et al., Curr Opin Chem Biol, 1(1):1 14-19 (1997); 
Dorner et al., BioorgMed Chem, 4(5):709-15 (1996) (alkylated dipeptides). 

Identification of modulators through use of the various libraries described herein 
permits modification of the candidate "hit" (or "lead") to optimize the capacity of the 
"hit" to bind a polypeptide of the invention. The molecules identified in the binding assay 
are then tested for antagonist or agonist activity in in vivo tissue culture or animal models 
that are well known in the art. In brief, the molecules are titrated into a plurality of cell 
cultures or animals and then tested for either cell/animal death or prolonged survival of 
the animal/cells. 

The binding molecules thus identified may be complexed with toxins, e.g., ricin 
or cholera, or with other compounds that are toxic to cells such as radioisotopes. The 
toxin-binding molecule complex is then targeted to a tumor or other cell by the specificity 
of the binding molecule for a polypeptide of the invention. Alternatively, the binding 
molecules may be complexed with imaging agents for targeting and imaging purposes. 

4.7.14 ASSAY FOR RECEPTOR ACTIVITY 

The invention also provides methods to detect specific binding of a polypeptide 
e.g. a ligand or a receptor. The art provides numerous assays particularly useful for 
identifying previously unknown binding partners for receptor polypeptides of the 
invention. For example, expression cloning using mammalian or bacterial cells, or 
dihybrid screening assays can be used to identify polynucleotides encoding binding 
partners. As another example, affinity chromatography with the appropriate immobilized 
polypeptide of the invention can be used to isolate polypeptides that recognize and bind 
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polypeptides of the invention. There are a number of different libraries used for the 
identification of compounds, and in particular small molecules, that modulate (i.e., 
increase or decrease) biological activity of a polypeptide of the invention. Ligands for 
receptor polypeptides of the invention can also be identified by adding exogenous 

5 ligands, or cocktails of ligands to two cells populations that are genetically identical 
except for the expression of the receptor of the invention: one cell population expresses 
the receptor of the invention whereas the other does not. The response of the two cell 
populations to the addition of ligands(s) are then compared. Alternatively, an expression 
library can be co-expressed with the polypeptide of the invention in cells and assayed for 

10 an autocrine response to identify potential ligand(s). As still another example, BIAcore 
assays, gel overlay assays, or other methods known in the art can be used to identify 
binding partner polypeptides, including, (1) organic and inorganic chemical libraries, (2) 
natural product libraries, and (3) combinatorial libraries comprised of random peptides, 
oligonucleotides or organic molecules. 

1 5 The role of downstream intracellular signaling molecules in the signaling cascade 

of the polypeptide of the invention can be determined. For example, a chimeric protein in 
which the cytoplasmic domain of the polypeptide of the invention is fused to the 
extracellular portion of a protein, whose ligand has been identified, is produced in a host 
cell. The cell is then incubated with the ligand specific for the extracellular portion of the 

20 chimeric protein, thereby activating the chimeric receptor. Known downstream proteins 
involved in intracellular signaling can then be assayed for expected modifications i.e. 
phosphorylation. Other methods known to those in the art can also be used to identify 
signaling molecules involved in receptor activity. 

25 4.7.15 ANTI-INFLAMMATORY ACTIVITY 

Compositions of the present invention may also exhibit anti-inflammatory 
activity. The anti-inflammatory activity may be achieved by providing a stimulus to cells 
involved in the inflammatory response, by inhibiting or promoting cell-cell interactions 
(such as, for example, cell adhesion), by inhibiting or promoting chemotaxis of cells 
30 involved in the inflammatory process, inhibiting or promoting cell extravasation, or by 
stimulating or suppressing production of other factors which more directly inhibit or 
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promote an inflammatory response. Compositions with such activities can be used to treat 
inflammatory conditions including chronic or acute conditions), including without 
limitation intimation associated with infection (such as septic shock, sepsis or systemic 
inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin 

5 lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting 
from over production of cytokines such as TNF or IL-1. Compositions of the invention 
may also be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or 
material. Compositions of this invention may be utilized to prevent or treat conditions 

10 such as, but not limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced 
shock, rheumatoid arthritis, chronic inflammatory arthritis, pancreatic cell damage from 
diabetes mellitus type 1, graft versus host disease, inflammatory bowel disease, 
inflamation associated with pulmonary disease, other autoimmune disease or 
inflammatory disease, an antiproliferative agent such as for acute or chronic mylegenous 

1 5 leukemia or in the prevention of premature labor secondary to intrauterine infections. 

4.7.16 LEUKEMIAS 

Leukemias and related disorders may be treated or prevented by administration of 
a therapeutic that promotes or inhibits function of the polynucleotides and/or 

20 polypeptides of the invention. Such leukemias and related disorders include but are not 
limited to acute leukemia, acute lymphocytic leukemia, acute myelocytic leukemia, 
myeloblastic, promyelocytic, myelomonocytic, monocytic, erythroleukemia, chronic 
leukemia, chronic myelocytic (granulocytic) leukemia and chronic lymphocytic leukemia 
(for a review of such disorders, see Fishman et al., 1985, Medicine, 2d Ed., J.B. 

25 Lippincott Co., Philadelphia). 

4.7.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efficacy of 
intervention with compounds that modulate the activity of the polynucleotides and/or 
30 polypeptides of the invention, and which can be treated upon thus observing an indication 
of therapeutic utility, include but are not limited to nervous system injuries, and diseases 
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or disorders which result in either a disconnection of axons, a diminution or degeneration 
of neurons, or demyelination. Nervous system lesions which may be treated in a patient 
(including human and non-human mammalian patients) according to the invention 
include but are not limited to the following lesions of either the central (including spinal 
cord, brain) or peripheral nervous systems: 

(i) traumatic lesions, including lesions caused by physical injury or associated 
with surgery, for example, lesions which sever a portion of the nervous system, or 
compression injuries; 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous 
system results in neuronal injury or death, including cerebral infarction or ischemia, or 
spinal cord infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 
injured as a result of infection, for example, by an abscess or associated with infection by 
human immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme 
disease, tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed 
or injured as a result of a degenerative process including but not limited to degeneration 
associated with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or 
amyotrophic lateral sclerosis; 

(v) lesions associated with nutritional diseases or disorders, in which a portion 
of the nervous system is destroyed or injured by a nutritional disorder or disorder of 
metabolism including but not limited to, vitamin B12 deficiency, folic acid deficiency, 
Wernicke disease, tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary 
degeneration of the corpus callosum), and alcoholic cerebellar degeneration; 

(vi) neurological lesions associated with systemic diseases including but not 
limited to diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, 
carcinoma, or sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 

neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is 
destroyed or injured by a demyelinating disease including but not limited to multiple 
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sclerosis, human immunodeficiency virus-associated myelopathy, transverse myelopathy 
or various etiologies, progressive multifocal leukoencephalopathy, and central pontine 
myelinolysis. 

Therapeutics which are useful according to the invention for treatment of a 
5 nervous system disorder may be selected by testing for biological activity in promoting 
the survival or differentiation of neurons. For example, and not by way of limitation, 
therapeutics which elicit any of the following effects may be useful according to the 
invention: 

(i) increased survival time of neurons in culture; 
1 0 (ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, 
e.g., choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 

(iv) decreased symptoms of neuron dysfunction in vivo. 

Such effects may be measured by any method known in the art. In preferred, 
1 5 non-limiting embodiments, increased survival of neurons may be measured by the 

method set forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting 
of neurons may be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 
70:65-82) or Brown et al (1981, Ann. Rev. Neurosci. 4:17-42); increased production of 
neuron-associated molecules may be measured by bioassay, enzymatic assay, antibody 
20 binding, Northern blot assay, etc., depending on the molecule to be measured; and motor 
neuron dysfunction may be measured by assessing the physical manifestation of motor 
neuron disorder, e.g., weakness, motor neuron conduction velocity, or functional 
disability. 

In specific embodiments, motor neuron disorders that may be treated according to 
25 the invention include but are not limited to disorders such as infarction, infection, 

exposure to toxin, trauma, surgical damage, degenerative disease or malignancy that may 
affect motor neurons as well as other components of the nervous system, as well as 
disorders that selectively affect neurons such as amyotrophic lateral sclerosis, and 
including but not limited to progressive spinal muscular atrophy, progressive bulbar 
30 palsy, primary lateral sclerosis, infantile and juvenile muscular atrophy, progressive 
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bulbar paralysis of childhood (Fazio-Londe syndrome), poliomyelitis and the post polio 
syndrome, and Hereditary Motorsensory Neuropathy (Charcot-Marie-Tooth Disease). 

4.7.18 OTHER ACTIVITIES 

5 A polypeptide of the invention may also exhibit one or more of the following 

additional activities or effects: inhibiting the growth, infection or function of, or killing, 
infectious agents, including, without limitation, bacteria, viruses, fungi and other 
parasites; effecting (suppressing or enhancing) bodily characteristics, including, without 
limitation, height, weight, hair color, eye color, skin, fat to lean ratio or other tissue 

10 pigmentation, or organ or body part size or shape (such as, for example, breast 

augmentation or diminution, change in bone form or shape); effecting biorhythms or 
circadian cycles or rhythms; effecting the fertility of male or female subjects; effecting 
the metabolism, catabolism, anabolism, processing, utilization, storage or elimination of 
dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other nutritional 

15 factors or component(s); effecting behavioral characteristics, including, without 

limitation, appetite, libido, stress, cognition (including cognitive disorders), depression 
(including depressive disorders) and violent behaviors; providing analgesic effects or 
other pain reducing effects; promoting differentiation and growth of embryonic stem cells 
in lineages other than hematopoietic lineages; hormonal or endocrine activity; in the case 

20 of enzymes, correcting deficiencies of the enzyme and treating deficiency-related 
diseases; treatment of hyperproliferative disorders (such as, for example, psoriasis); 
immunoglobulin-like activity (such as, for example, the ability to bind antigens or 
complement); and the ability to act as an antigen in a vaccine composition to raise an 
immune response against such protein or another material or entity which is 

25 cross-reactive with such protein. 

4.7.19 IDENTIFICATION OF POLYMORPHISMS 

The demonstration of polymorphisms makes possible the identification of such 
polymorphisms in human subjects and the pharmacogenetic use of this information for 
30 diagnosis and treatment. Such polymorphisms may be associated with, e.g., differential 
predisposition or susceptibility to various disease states (such as disorders involving 
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inflammation or immune response) or a differential response to drug administration, and 
this genetic information can be used to tailor preventive or therapeutic treatment 
appropriately. For example, the existence of a polymorphism associated with a 
predisposition to inflammation or autoimmune disease makes possible the diagnosis of 

5 this condition in humans by identifying the presence of the polymorphism. 

Polymorphisms can be identified in a variety of ways known in the art which all 
generally involve obtaining a sample from a patient, analyzing DNA from the sample, 
optionally involving isolation or amplification of the DNA, and identifying the presence 
of the polymorphism in the DNA. For example, PCR may be used to amplify an 

1 0 appropriate fragment of genomic DNA which may then be sequenced. Alternatively, the 
DNA may be subjected to allele-specific oligonucleotide hybridization (in which 
appropriate oligonucleotides are hybridized to the DNA under conditions permitting 
detection of a single base mismatch) or to a single nucleotide extension assay (in which 
an oligonucleotide that hybridizes immediately adjacent to the position of the 

1 5 polymorphism is extended with one or more labeled nucleotides). In addition, traditional 
restriction fragment length polymorphism analysis (using restriction enzymes that 
provide differential digestion of the genomic DNA depending on the presence or absence 
of the polymorphism) may be performed. Arrays with nucleotide sequences of the 
present invention can be used to detect polymorphisms. The array can comprise modified 

20 nucleotide sequences of the present invention in order to detect the nucleotide sequences 
of the present invention. In the alternative, any one of the nucleotide sequences of the 
present invention can be placed on the array to detect changes from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence 
could also be detected by detecting a corresponding change in amino acid sequence of the 

25 protein, e.g., by an antibody specific to the variant sequence. 

4.7.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against 
rheumatoid arthritis is determined in an experimental animal model system. The 
30 experimental model system is adjuvant induced arthritis in rats, and the protocol is 

described by J. Holoshitz, et at., 1983, Science, 219:56, or by B. Waksman et al., 1963, 
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Int. Arch. Allergy Appl. Immunol., 23:129. Induction of the disease can be caused by a 
single injection, generally intradermally, of a suspension of killed Mycobacterium 
tuberculosis in complete Freund's adjuvant (CFA). The route of injection can vary, but 
rats may be injected at the base of the tail with an adjuvant mixture. The polypeptide is 
5 administered in phosphate buffered solution (PBS) at a dose of about 1-5 mg/kg. The 
control consists of administering PBS only. 

The procedure for testing the effects of the test compound would consist of 
intradermally injecting killed Mycobacterium tuberculosis in CFA followed by 
immediately administering the test compound and subsequent treatment every other day 
10 until day 24. At 14, 15, 18, 20, 22, and 24 days after injection of Mycobacterium CFA, an 
overall arthritis score may be obtained as described by J. Holoskitz above. An analysis of 
the data would reveal that the test compound would have a dramatic affect on the 
swelling of the joints as measured by a decrease of the arthritis score. 

15 

4,8 THERAPEUTIC METHODS 

The compositions (including polypeptide fragments, analogs, variants and 
antibodies or other binding partners or modulators including antisense polynucleotides) 
of the invention have numerous applications in a variety of therapeutic methods. 
20 Examples of therapeutic applications include, but are not limited to, those exemplified 
herein. 

4.8.1 EXAMPLE 

One embodiment of the invention is the administration of an effective amount of 
25 the polypeptides or other composition of the invention to individuals affected by a 

disease or disorder that can be modulated by regulating the peptides of the invention. 

While the mode of administration is not particularly important, parenteral administration 

is preferred. An exemplary mode of administration is to deliver an intravenous bolus. 

The dosage of the polypeptides or other composition of the invention will normally be 
30 determined by the prescribing physician. It is to be expected that the dosage will vary 

according to the age, weight, condition and response of the individual patient. Typically, 
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the amount of polypeptide administered per dose will be in the range of about 0.01(ag/kg 
to 100 mg/kg of body weight, with the preferred dose being about 0.1p,g/kg to 10 mg/kg 
of patient body weight. For parenteral administration, polypeptides of the invention will 
be formulated in an injectable form combined with a pharmaceutical^ acceptable 
5 parenteral vehicle. Such vehicles are well known in the art and examples include water, 
saline, Ringer's solution, dextrose solution, and solutions consisting of small amounts of 
the human serum albumin. The vehicle may contain minor amounts of additives that 
maintain the isotonicity and stability of the polypeptide or other active ingredient. The 
preparation of such solutions is within the skill of the art. 

10 

4.9 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 
ADMINISTRATION 

A protein or other composition of the present invention (from whatever source 

1 5 derived, including without limitation from recombinant and non-recombinant sources and 
including antibodies and other binding partners of the polypeptides of the invention) may 
be administered to a patient in need, by itself, or in pharmaceutical compositions where it 
is mixed with suitable carriers or excipient(s) at doses to treat or ameliorate a variety of 
disorders. Such a composition may optionally contain (in addition to protein or other 

20 active ingredient and a carrier) diluents, fillers, salts, buffers, stabilizers, solubilizers, and 
other materials well known in the art. The term "pharmaceutical^ acceptable" means a 
non-toxic material that does not interfere with the effectiveness of the biological activity 
of the active ingredient(s). The characteristics of the carrier will depend on the route of 
administration. The pharmaceutical composition of the invention may also contain 

25 cytokines, lymphokines, or other hematopoietic factors such as M-CSF, GM-CSF, TNF, 
IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, IL-14, 
IL-15, EFN, TNF0, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell factor, 
and erythropoietin. In further compositions, proteins of the invention may be combined 
with other agents beneficial to the treatment of the disease or disorder in question. These 

30 agents include various growth factors such as epidermal growth factor (EGF), 
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platelet-derived growth factor (PDGF), transforming growth factors (TGF-oc and TGF-P), 
insulin-like growth factor (IGF), as well as cytokines described herein. 

The pharmaceutical composition may further contain other agents which either 
enhance the activity of the protein or other active ingredient or complement its activity or 
use in treatment. Such additional factors and/or agents may be included in the 
pharmaceutical composition to produce a synergistic effect with protein or other active 
ingredient of the invention, or to minimize side effects. Conversely, protein or other 
active ingredient of the present invention may be included in formulations of the 
particular clotting factor, cytokine, lymphokine, other hematopoietic factor, thrombolytic 
or anti-thrombotic factor, or anti- inflammatory agent to minimize side effects of the 
clotting factor, cytokine, lymphokine, other hematopoietic factor, thrombolytic or 
anti-thrombotic factor, or anti-inflammatory agent (such as IL-IRa, IL-1 Hyl, IL-1 Hy2, 
anti-TNF, corticosteroids, immunosuppressive agents). A protein of the present 
invention may be active in multimers (e.g., heterodimers or homodimers) or complexes 
with itself or other proteins. As a result, pharmaceutical compositions of the invention 
may comprise a protein of the invention in such multimeric or complexed form. 

As an alternative to being included in a pharmaceutical composition of the 
invention including a first protein, a second protein or a therapeutic agent may be 
concurrently administered with the first protein (e.g., at the same time, or at differing 
times provided that therapeutic concentrations of the combination of agents is achieved at 
the treatment site). Techniques for formulation and administration of the compounds of 
the instant application may be found in "Remington's Pharmaceutical Sciences," Mack 
Publishing Co., Easton, PA, latest edition. A therapeutically effective dose further refers 
to that amount of the compound sufficient to result in amelioration of symptoms, e.g., 
treatment, healing, prevention or amelioration of the relevant medical condition, or an 
increase in rate of treatment, healing, prevention or amelioration of such conditions. 
When applied to an individual active ingredient, administered alone, a therapeutically 
effective dose refers to that ingredient alone. When applied to a combination, a 
therapeutically effective dose refers to combined amounts of the active ingredients that 
result in the therapeutic effect, whether administered in combination, serially or 
simultaneously. 
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In practicing the method of treatment or use of the present invention, a 
therapeutically effective amount of protein or other active ingredient of the present 
invention is administered to a mammal having a condition to be treated. Protein or other 
active ingredient of the present invention may be administered in accordance with the 
method of the invention either alone or in combination with other therapies such as 
treatments employing cytokines, lymphokines or other hematopoietic factors. When co- 
administered with one or more cytokines, lymphokines or other hematopoietic factors, 
protein or other active ingredient of the present invention may be administered either 
simultaneously with the cytokine(s), lymphokine(s), other hematopoietic factor(s), 
thrombolytic or anti-thrombotic factors, or sequentially. If administered sequentially, the 
attending physician will decide on the appropriate sequence of administering protein or 
other active ingredient of the present invention in combination with cytokine(s), 
lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic factors. 

4.9.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, for example, include oral, rectal, 
transmucosal, or intestinal administration; parenteral delivery, including intramuscular, 
subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, 
intravenous, intraperitoneal, intranasal, or intraocular injections. Administration of 
protein or other active ingredient of the present invention used in the pharmaceutical 
composition or to practice the method of the present invention can be carried out in a 
variety of conventional ways, such as oral ingestion, inhalation, topical application or 
cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. Intravenous 
administration to the patient is preferred. 

Alternately, one may administer the compound in a local rather than systemic 
manner, for example, via injection of the compound directly into a arthritic joints or in 
fibrotic tissue, often in a depot or sustained release formulation. In order to prevent the 
scarring process frequently occurring as complication of glaucoma surgery, the 
compounds may be administered topically, for example, as eye drops. Furthermore, one 
may administer the drug in a targeted drug delivery system, for example, in a liposome 
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coated with a specific antibody, targeting, for example, arthritic or fibrotic tissue. The 
liposomes will be targeted to and taken up selectively by the afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an 
effective dosage to the desired site of action. The determination of a suitable route of 

5 administration and an effective dosage for a particular indication is within the level of 
skill in the art. Preferably for wound treatment, one administers the therapeutic 
compound directly to the site. Suitable dosage ranges for the polypeptides of the 
invention can be extrapolated from these dosages or from similar studies in appropriate 
animal models. Dosages can then be adjusted as necessary by the clinician to provide 

1 0 maximal therapeutic benefit. 

4,9.2 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention 
thus may be formulated in a conventional manner using one or more physiologically 

1 5 acceptable carriers comprising excipients and auxiliaries which facilitate processing of 
the active compounds into preparations which can be used pharmaceutical^. These 
pharmaceutical compositions may be manufactured in a manner that is itself known, e.g., 
by means of conventional mixing, dissolving, granulating, dragee-making, levigating, 
emulsifying, encapsulating, entrapping or lyophilizing processes. Proper formulation is 

20 dependent upon the route of administration chosen. When a therapeutically effective 
amount of protein or other active ingredient of the present invention is administered 
orally, protein or other active ingredient of the present invention will be in the form of a 
tablet, capsule, powder, solution or elixir. When administered in tablet form, the 
pharmaceutical composition of the invention may additionally contain a solid carrier such 

25 as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 95% 
protein or other active ingredient of the present invention, and preferably from about 25 
to 90% protein or other active ingredient of the present invention. When administered in 
liquid form, a liquid carrier such as water, petroleum, oils of animal or plant origin such 
as peanut oil, mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The 

30 liquid form of the pharmaceutical composition may further contain physiological saline 
solution, dextrose or other saccharide solution, or glycols such as ethylene glycol, 
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propylene glycol or polyethylene glycol. When administered in liquid form, the 
pharmaceutical composition contains from about 0.5 to 90% by weight of protein or other 
active ingredient of the present invention, and preferably from about 1 to 50% protein or 
other active ingredient of the present invention. 
5 When a therapeutically effective amount of protein or other active ingredient of 

the present invention is administered by intravenous, cutaneous or subcutaneous 
injection, protein or other active ingredient of the present invention will be in the form of 
a pyrogen-free, parenterally acceptable aqueous solution. The preparation of such 
parenterally acceptable protein or other active ingredient solutions, having due regard to 
1 0 pH, isotonicity, stability, and the like, is within the skill in the art. A preferred 

pharmaceutical composition for intravenous, cutaneous, or subcutaneous injection should 
contain, in addition to protein or other active ingredient of the present invention, an 
isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, Dextrose 
Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or other 
1 5 vehicle as known in the art. The pharmaceutical composition of the present invention 
may also contain stabilizers, preservatives, buffers, antioxidants, or other additives 
known to those of skill in the art. For injection, the agents of the invention may be 
formulated in aqueous solutions, preferably in physiologically compatible buffers such as 
Hanks's solution, Ringer's solution, or physiological saline buffer. For transmucosal 
20 administration, penetrants appropriate to the barrier to be permeated are used in the 
formulation. Such penetrants are generally known in the art. 

For oral administration, the compounds can be formulated readily by combining 
the active compounds with pharmaceutically acceptable carriers well known in the art. 
Such carriers enable the compounds of the invention to be formulated as tablets, pills, 
25 dragees, capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral 
ingestion by a patient to be treated. Pharmaceutical preparations for oral use can be 
obtained from a solid excipient, optionally grinding a resulting mixture, and processing 
the mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or 
dragee cores. Suitable excipients are, in particular, fillers such as sugars, including 
30 lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for example, maize 
starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl cellulose, 



789CIP2C 



74 



hydroxypropylmethyl-cellulose, sodium carboxymethylcellulose, and/or 
polyvinylpyrrolidone (PVP). If desired, disintegrating agents maybe added, such as the 
cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such as sodium 
alginate. Dragee cores are provided with suitable coatings. For this purpose, 

5 concentrated sugar solutions may be used, which may optionally contain gum arabic, talc, 
polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 
solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may 
be added to the tablets or dragee coatings for identification or to characterize different 
combinations of active compound doses. 

1 0 Pharmaceutical preparations which can be used orally include push-fit capsules 

made of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as 
glycerol or sorbitol The push-fit capsules can contain the active ingredients in admixture 
with filler such as lactose, binders such as starches, and/or lubricants such as talc or 
magnesium stearate and, optionally, stabilizers. In soft capsules, the active compounds 

15 may be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or 
liquid polyethylene glycols. In addition, stabilizers maybe added. All formulations for 
oral administration should be in dosages suitable for such administration. For buccal 
administration, the compositions may take the form of tablets or lozenges formulated in 
conventional manner. 

20 For administration by inhalation, the compounds for use according to the present 

invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 
dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon 
dioxide or other suitable gas. In the case of a pressurized aerosol the dosage unit may be 

25 determined by providing a valve to deliver a metered amount. Capsules and cartridges 
of, e.g., gelatin for use in an inhaler or insufflator may be formulated containing a powder 
mix of the compound and a suitable powder base such as lactose or starch. The 
compounds may be formulated for parenteral administration by injection, e.g., by bolus 
injection or continuous infusion. Formulations for injection may be presented in unit 

30 dosage form, e.g., in ampules or in multi-dose containers, with an added preservative. 
The compositions may take such forms as suspensions, solutions or emulsions in oily or 
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aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing 
and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous 
solutions of the active compounds in water-soluble form. Additionally, suspensions of 

5 the active compounds may be prepared as appropriate oily injection suspensions. 

Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or synthetic 
fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. Aqueous injection 
suspensions may contain substances which increase the viscosity of the suspension, such 
as sodium carboxymethyl cellulose, sorbitol, or dextran. Optionally, the suspension may 

10 also contain suitable stabilizers or agents which increase the solubility of the compounds 
to allow for the preparation of highly concentrated solutions. Alternatively, the active 
ingredient may be in powder form for constitution with a suitable vehicle, e.g., sterile 
pyrogen- free water, before use. 

The compounds may also be formulated in rectal compositions such as 

1 5 suppositories or retention enemas, e.g. , containing conventional suppository bases such as 
cocoa butter or other glycerides. In addition to the formulations described previously, the 
compounds may also be formulated as a depot preparation. Such long acting 
formulations may be administered by implantation (for example subcutaneously or 
intramuscularly) or by intramuscular injection. Thus, for example, the compounds may 

20 be formulated with suitable polymeric or hydrophobic materials (for example as an 

emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, 
for example, as a sparingly soluble salt. 

A pharmaceutical carrier for the hydrophobic compounds of the invention is a co- 
solvent system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible 

25 organic polymer, and an aqueous phase. The co-solvent system may be the VPD 

co-solvent system. VPD is a solution of 3% w/v benzyl alcohol, 8% w/v of the nonpolar 
surfactant polysorbate 80, and 65% w/v polyethylene glycol 300, made up to volume in 
absolute ethanol. The VPD co-solvent system (VPD:5W) consists of VPD diluted 1:1 
with a 5% dextrose in water solution. This co-solvent system dissolves hydrophobic 

30 compounds well, and itself produces low toxicity upon systemic administration. 

Naturally, the proportions of a co-solvent system may be varied considerably without 
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destroying its solubility and toxicity characteristics. Furthermore, the identity of the 
co-solvent components may be varied: for example, other low-toxicity nonpolar 
surfactants maybe used instead of polysorbate 80; the fraction size of polyethylene 
glycol maybe varied; other biocompatible polymers may replace polyethylene glycol, 
5 e.g. polyvinyl pyrrolidone; and other sugars or polysaccharides may substitute for 
dextrose. Alternatively, other delivery systems for hydrophobic pharmaceutical 
compounds may be employed. Liposomes and emulsions are well known examples of 
delivery vehicles or carriers for hydrophobic drugs. Certain organic solvents such as 
dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 

10 Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. 
Various types of sustained-release materials have been established and are well known by 
those skilled in the art. Sustained-release capsules may, depending on their chemical 
nature, release the compounds for a few weeks up to over 100 days. Depending on the 

15 chemical nature and the biological stability of the therapeutic reagent, additional 
strategies for protein or other active ingredient stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase 
carriers or excipients. Examples of such carriers or excipients include but are not limited 
to calcium carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, 

20 gelatin, and polymers such as polyethylene glycols. Many of the active ingredients of the 
invention may be provided as salts with pharmaceutical^ compatible counter ions. Such 
pharmaceutically acceptable base addition salts are those salts which retain the biological 
effectiveness and properties of the free acids and which are obtained by reaction with 
inorganic or organic bases such as sodium hydroxide, magnesium hydroxide, ammonia, 

25 trialkylamine, dialkylamine, monoalkylamine, dibasic amino acids, sodium acetate, 
potassium benzoate, Methanol amine and the like. 

The pharmaceutical composition of the invention may be in the form of a 
complex of the protein(s) or other active ingredient(s) of present invention along with 
protein or peptide antigens. The protein and/or peptide antigen will deliver a stimulatory 

30 signal to both B and T lymphocytes. B lymphocytes will respond to antigen through their 
surface immunoglobulin receptor. T lymphocytes will respond to antigen through the T 
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cell receptor (TCR) following presentation of the antigen by MHC proteins. MHC and 
structurally related proteins including those encoded by class I and class II MHC genes 
on host cells will serve to present the peptide antigen(s) to T lymphocytes. The antigen 
components could also be supplied as purified MHC-peptide complexes alone or with 
5 co-stimulatory molecules that can directly signal T cells. Alternatively antibodies able to 
bind surface immunoglobulin and other molecules on B cells as well as antibodies able to 
bind the TCR and other molecules on T cells can be combined with the pharmaceutical 
composition of the invention. 

The pharmaceutical composition of the invention may be in the form of a 

10 liposome in which protein of the present invention is combined, in addition to other 

pharmaceutically acceptable carriers, with amphipathic agents such as lipids which exist 
in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers 
in aqueous solution. Suitable lipids for liposomal formulation include, without limitation, 
monoglycerides, diglycerides, sulfatides, lysolecithins, phospholipids, saponin, bile acids, 

15 and the like. Preparation of such liposomal formulations is within the level of skill in the 
art, as disclosed, for example, in U.S. Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 
4,737,323, all of which are incorporated herein by reference. 

The amount of protein or other active ingredient of the present invention in the 
pharmaceutical composition of the present invention will depend upon the nature and 

20 severity of the condition being treated, and on the nature of prior treatments which the 
patient has undergone. Ultimately, the attending physician will decide the amount of 
protein or other active ingredient of the present invention with which to treat each 
individual patient. Initially, the attending physician will administer low doses of protein 
or other active ingredient of the present invention and observe the patient's response. 

25 Larger doses of protein or other active ingredient of the present invention may be 

administered until the optimal therapeutic effect is obtained for the patient, and at that 
point the dosage is not increased further. It is contemplated that the various 
pharmaceutical compositions used to practice the method of the present invention should 
contain about 0.01 \ig to about 100 mg (preferably about 0.1 jug to about 10 mg, more 

30 preferably about 0.1 \xg to about 1 mg) of protein or other active ingredient of the present 
invention per kg body weight. For compositions of the present invention which are 
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useful for bone, cartilage, tendon or ligament regeneration, the therapeutic method 
includes administering the composition topically, systematically, or locally as an implant 
or device. When administered, the therapeutic composition for use in this invention is, of 
course, in a pyrogen-free, physiologically acceptable form. Further, the composition may 
5 desirably be encapsulated or injected in a viscous form for delivery to the site of bone, 
cartilage or tissue damage. Topical administration may be suitable for wound healing 
and tissue repair. Therapeutically useful agents other than a protein or other active 
ingredient of the invention which may also optionally be included in the composition as 
described above, may alternatively or additionally, be administered simultaneously or 

1 0 sequentially with the composition in the methods of the invention. Preferably for bone 
and/or cartilage formation, the composition would include a matrix capable of delivering 
the protein-containing or other active ingredient-containing composition to the site of 
bone and/or cartilage damage, providing a structure for the developing bone and cartilage 
and optimally capable of being resorbed into the body. Such matrices may be formed of 

1 5 materials presently in use for other implanted medical applications. 

The choice of matrix material is based on biocompatibility, biodegradability, 
mechanical properties, cosmetic appearance and interface properties. The particular 
application of the compositions will define the appropriate formulation. Potential 
matrices for the compositions may be biodegradable and chemically defined calcium 

20 sulfate, tricalcium phosphate, hydroxyapatite, polylactic acid, polyglycolic acid and 
polyanhydrides. Other potential materials are biodegradable and biologically 
well-defined, such as bone or dermal collagen. Further matrices are comprised of pure 
proteins or extracellular matrix components. Other potential matrices are 
nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, 

25 aluminates, or other ceramics. Matrices may be comprised of combinations of any of the 
above mentioned types of material, such as polylactic acid and hydroxyapatite or 
collagen and tricalcium phosphate. The bioceramics may be altered in composition, such 
as in calcium-aluminate-phosphate and processing to alter pore size, particle size, particle 
shape, and biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of 

30 lactic acid and glycolic acid in the form of porous particles having diameters ranging 
from 150 to 800 microns. In some applications, it will be useful to utilize a sequestering 
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agent, such as carboxymethyl cellulose or autologous blood clot, to prevent the protein 
compositions from disassociating from the matrix. 

A preferred family of sequestering agents is cellulosic materials such as 
alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, 
5 ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, 

hydroxypropyl-methylcellulose, and carboxymethylcellulose, the most preferred being 
cationic salts of carboxymethylcellulose (CMC). Other preferred sequestering agents 
include hyaluronic acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, 
carboxyvinyl polymer and polyvinyl alcohol). The amount of sequestering agent useful 

10 herein is 0.5-20 wt %, preferably 1-10 wt % based on total formulation weight, which 
represents the amount necessary to prevent desorption of the protein from the polymer 
matrix and to provide appropriate handling of the composition, yet not so much that the 
progenitor cells are prevented from infiltrating the matrix, thereby providing the protein 
the opportunity to assist the osteogenic activity of the progenitor cells. In further 

15 compositions, proteins or other active ingredients of the invention may be combined with 
other agents beneficial to the treatment of the bone and/or cartilage defect, wound, or 
tissue in question. These agents include various growth factors such as epidermal growth 
factor (EGF), platelet derived growth factor (PDGF), transforming growth factors 
(TGF-a and TGF-p), and insulin-like growth factor (IGF). 

20 The therapeutic compositions are also presently valuable for veterinary 

applications. Particularly domestic animals and thoroughbred horses, in addition to 
humans, are desired patients for such treatment with proteins or other active ingredients 
of the present invention. The dosage regimen of a protein-containing pharmaceutical 
composition to be used in tissue regeneration will be determined by the attending 

25 physician considering various factors which modify the action of the proteins, e.g., 
amount of tissue weight desired to be formed, the site of damage, the condition of the 
damaged tissue, the size of a wound, type of damaged tissue {e.g., bone), the patient's 
age, sex, and diet, the severity of any infection, time of administration and other clinical 
factors. The dosage may vary with the type of matrix used in the reconstitution and with 

30 inclusion of other proteins in the pharmaceutical composition. For example, the addition 
of other known growth factors, such as IGF I (insulin like growth factor I), to the final 
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composition, may also effect the dosage. Progress can be monitored by periodic 
assessment of tissue/bone growth and/or repair, for example, X-rays, histomorphometric 
determinations and tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
5 polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other 
known methods for introduction of nucleic acid into a cell or organism (including, 
without limitation, in the form of viral vectors or naked DNA). Cells may also be 
cultured ex vivo in the presence of proteins of the present invention in order to proliferate 
10 or to produce a desired effect on or activity in such cells. Treated cells can then be 
introduced in vivo for therapeutic purposes. 

4.9.3 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 

1 5 compositions wherein the active ingredients are contained in an effective amount to 
achieve its intended purpose. More specifically, a therapeutically effective amount 
means an amount effective to prevent development of or to alleviate the existing 
symptoms of the subject being treated. Determination of the effective amount is well 
within the capability of those skilled in the art, especially in light of the detailed 

20 disclosure provided herein. For any compound used in the method of the invention, the 
therapeutically effective dose can be estimated initially from appropriate in vitro assays. 
For example, a dose can be formulated in animal models to achieve a circulating 
concentration range that can be used to more accurately determine useful doses in 
humans. For example, a dose can be formulated in animal models to achieve a 

25 circulating concentration range that includes the IC50 as determined in cell culture (i.e., 
the concentration of the test compound which achieves a half-maximal inhibition of the 
protein's biological activity). Such information can be used to more accurately determine 
useful doses in humans. 

A therapeutically effective dose refers to that amount of the compound that results 

30 in amelioration of symptoms or a prolongation of survival in a patient. Toxicity and 
therapeutic efficacy of such compounds can be determined by standard pharmaceutical 
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procedures in cell cultures or experimental animals, e.g., for determining the LD50 (the 
dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 
50% of the population). The dose ratio between toxic and therapeutic effects is the 
therapeutic index and it can be expressed as the ratio between LD50 and ED50. 
5 Compounds which exhibit high therapeutic indices are preferred. The data obtained from 
these cell culture assays and animal studies can be used in formulating a range of dosage 
for use in human. The dosage of such compounds lies preferably within a range of 
circulating concentrations that include the ED50 with little or no toxicity. The dosage 
may vary within this range depending upon the dosage form employed and the route of 

10 administration utilized. The exact formulation, route of administration and dosage can be 
chosen by the individual physician in view of the patient's condition. See, e.g., Fingl et 
al., 1975, in "The Pharmacological Basis of Therapeutics", Ch. 1 p.l. Dosage amount 
and interval may be adjusted individually to provide plasma levels of the active moiety 
which are sufficient to maintain the desired effects, or minimal effective concentration 

15 (MEC). The MEC will vary for each compound but can be estimated from in vitro data. 
Dosages necessary to achieve the MEC will depend on individual characteristics and 
route of administration. However, HPLC assays or bioassays can be used to determine 
plasma concentrations. 

Dosage intervals can also be determined using MEC value. Compounds should 

20 be administered using a regimen which maintains plasma levels above the MEC for 
10-90% of the time, preferably between 30-90% and most preferably between 50-90%. 
In cases of local administration or selective uptake, the effective local concentration of 
the drug may not be related to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the 

25 invention will be in the range of about 0.01 j-ig/kg to 100 mg/kg of body weight daily, 
with the preferred dose being about 0.1 ]ug/kg to 25 mg/kg of patient body weight daily, 
varying in adults and children. Dosing may be once daily, or equivalent doses may be 
delivered at longer or shorter intervals. 

The amount of composition administered will, of course, be dependent on the 

30 subject being treated, on the subject's age and weight, the severity of the affliction, the 
manner of administration and the judgment of the prescribing physician. 
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4.9.4 PACKAGING 

The compositions may, if desired, be presented in a pack or dispenser device 
which may contain one or more unit dosage forms containing the active ingredient. The 
pack may, for example, comprise metal or plastic foil, such as a blister pack. The pack or 
dispenser device may be accompanied by instructions for administration. Compositions 
comprising a compound of the invention formulated in a compatible pharmaceutical 
carrier may also be prepared, placed in an appropriate container, and labeled for 
treatment of an indicated condition. 



4.10 ANTIBODIES 

Another aspect of the invention is an antibody that specifically binds the 
polypeptide of the invention. Such antibodies include monoclonal and polyclonal 
antibodies, single chain antibodies, chimeric antibodies, bifunctional/bispecific 

15 antibodies, humanized antibodies, human antibodies, and complementary determining 
region (CDR)-grafted antibodies, including compounds which include CDR and/or 
antigen-binding sequences, which specifically recognize a polypeptide of the invention. 
Preferred antibodies of the invention are human antibodies which are produced and 
identified according to methods described in W093/1 1236, published June 20, 1993, 

20 which is incorporated herein by reference in its entirety. Antibody fragments, including 
Fab, Fab', F(ab')2, and F v , are also provided by the invention. The term "specific for" 
indicates that the variable regions of the antibodies of the invention recognize and bind 
polypeptides of the invention exclusively (i.e., able to distinguish the polypeptide of the 
invention from other similar polypeptides despite sequence identity, homology, or 

25 similarity found in the family of polypeptides), but may also interact with other proteins 
(for example, S. aureus protein A or other antibodies in ELISA techniques) through 
interactions with sequences outside the variable region of the antibodies, and in 
particular, in the constant region of the molecule. Screening assays to determine binding 
specificity of an antibody of the invention are well known and routinely practiced in the 

30 art. For a comprehensive discussion of such assays, see Harlow et al. (Eds), Antibodies 
A Laboratory Manual; Cold Spring Harbor Laboratory; Cold Spring Harbor , NY (1988), 
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Chapter 6. Antibodies that recognize and bind fragments of the polypeptides of the 
invention are also contemplated, provided that the antibodies are first and foremost 
specific for, as defined above, full length polypeptides of the invention. As with 
antibodies that are specific for full length polypeptides of the invention, antibodies of the 
5 invention that recognize fragments are those which can distinguish polypeptides from the 
same family of polypeptides despite inherent sequence identity, homology, or similarity 
found in the family of proteins. Antibodies of the invention can be produced using any 
method well known and routinely practiced in the art. 

Non-human antibodies may be humanized by any methods known in the art. In 

10 one method, the non-human CDRs are inserted into a human antibody or consensus 

antibody framework sequence. Further changes can then be introduced into the antibody 
framework to modulate affinity or immunogenicity. 

Antibodies of the invention are useful for, for example, therapeutic purposes (by 
modulating activity of a polypeptide of the invention), diagnostic purposes to detect or 

15 quantitate a polypeptide of the invention, as well as purification of a polypeptide of the 
invention. Kits comprising an antibody of the invention for any of the purposes 
described herein are also comprehended. In general, a kit of the invention also includes a 
control antigen for which the antibody is immunospecific. The invention further provides 
a hybridoma that produces an antibody according to the invention. Antibodies of the 

20 invention are useful for detection and/or purification of the polypeptides of the invention. 

Polypeptides of the invention may also be used to immunize animals to obtain 
polyclonal and monoclonal antibodies which specifically react with the protein. Such 
antibodies may be obtained using either the entire protein or fragments thereof as an 
immunogen. The peptide immunogens additionally may contain a cysteine residue at the 

25 carboxyl terminus, and are conjugated to a hapten such as keyhole limpet hemocyanin 
(KLH). Methods for synthesizing such peptides are known in the art, for example, as in 
R. P. Merrifield, J. Amer. Chem. Soc. 85, 2149-2154 (1963); J. L. Krstenansky, et al., 
FEBS Lett. 211, 10(1987). 

Monoclonal antibodies binding to the protein of the invention may be useful 

30 diagnostic agents for the immunodetection of the protein. Neutralizing monoclonal 
antibodies binding to the protein may also be useful therapeutics for both conditions 
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associated with the protein and also in the treatment of some forms of cancer where 
abnormal expression of the protein is involved. In the case of cancerous cells or 
leukemic cells, neutralizing monoclonal antibodies against the protein may be useful in 
detecting and preventing the metastatic spread of the cancerous cells, which may be 
5 mediated by the protein. In general, techniques for preparing polyclonal and monoclonal 
antibodies as well as hybridomas capable of producing the desired antibody are well 
known in the art (Campbell, A.M., Monoclonal Antibodies Technology: Laboratory 
Techniques in Biochemistry and Molecular Biology, Elsevier Science Publishers, 
Amsterdam, The Netherlands (1984); St. Groth et al., J. Immunol. 35:1-21 (1990); Kohler 

1 0 and Milstein, Nature 256:495-497 (1975)), the trioma technique, the human B-cell 
hybridoma technique (Kozbor et al., Immunology Today 4:72 (1983); Cole et ah, in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. (1985), pp. 77-96). 

Any animal (mouse, rabbit, etc.) which is known to produce antibodies can be 
immunized with a peptide or polypeptide of the invention. Methods for immunization are 

1 5 well known in the art. Such methods include subcutaneous or intraperitoneal injection of 
the polypeptide. One skilled in the art will recognize that the amount of the protein 
encoded by the ORF of the present invention used for immunization will vary based on 
the animal which is immunized, the antigenicity of the peptide and the site of injection. 
The protein that is used as an immunogen may be modified or administered in an 

20 adjuvant in order to increase the protein's antigenicity. Methods of increasing the 
antigenicity of a protein are well known in the art and include, but are not limited to, 
coupling the antigen with a heterologous protein (such as globulin or p-galactosidase) or 
through the inclusion of an adjuvant during immunization. 

For monoclonal antibodies, spleen cells from the immunized animals are 

25 removed, fused with myeloma cells, such as SP2/0-Agl4 myeloma cells, and allowed to 
become monoclonal antibody producing hybridoma cells. Any one of a number of 
methods well known in the art can be used to identify the hybridoma cell which produces 
an antibody with the desired characteristics. These include screening the hybridomas 
with an ELISA assay, Western blot analysis, or radioimmunoassay (Lutz et ah, Exp. Cell 

30 Research. 175:109-124 (1988)). Hybridomas secreting the desired antibodies are cloned 
and the class and subclass is determined using procedures known in the art (Campbell, 
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A.M., Monoclonal Antibody Technology: Laboratory Techniques in Biochemistry and 
Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1984)). 
Techniques described for the production of single chain antibodies (U.S. Patent 
4,946,778) can be adapted to produce single chain antibodies to proteins of the present 
5 invention. 

For polyclonal antibodies, antibody-containing antiserum is isolated from the 
immunized animal and is screened for the presence of antibodies with the desired 
specificity using one of the above-described procedures. The present invention further 
provides the above- described antibodies in delectably labeled form. Antibodies can be 

10 delectably labeled through the use of radioisotopes, affinity labels (such as biotin, avidin, 
etc.), enzymatic labels (such as horseradish peroxidase, alkaline phosphatase, etc.) 
fluorescent labels (such as FITC or rhodamine, etc.), paramagnetic atoms, etc. 
Procedures for accomplishing such labeling are well-known in the art, for example, see 
(Sternberger, L.A. et al., J. Histochem. Cytochem. 18:315 (1970); Bayer, E.A. et al., 

15 Meth. Enzym. 62:308 (1979); Engval, E. et al., Immunol 109:129 (1972); Goding, J.W. 
J. Immunol. Meth. 13:215 (1976)). 

The labeled antibodies of the present invention can be used for in vitro, in vivo, 
and in situ assays to identify cells or tissues in which a fragment of the polypeptide of 
interest is expressed. The antibodies may also be used directly in therapies or other 

20 diagnostics. The present invention further provides the above-described antibodies 

immobilized on a solid support. Examples of such solid supports include plastics such as 
polycarbonate, complex carbohydrates such as agarose and Sepharose®, acrylic resins 
and such as polyacrylamide and latex beads. Techniques for coupling antibodies to such 
solid supports are well known in the art (Weir, D.M. et al., "Handbook of Experimental 

25 Immunology" 4th Ed., Blackwell Scientific Publications, Oxford, England, Chapter 10 
(1986); Jacoby, W.D. et al, Meth. Enzym. 34 Academic Press, N.Y. (1974)). The 
immobilized antibodies of the present invention can be used for in vitro, in vivo, and in 
situ assays as well as for immuno-affinity purification of the proteins of the present 
invention. 

30 

4.11 COMPUTER READABLE SEQUENCES 
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In one application of this embodiment, a nucleotide sequence of the present 
invention can be recorded on computer readable media. As used herein, "computer 
readable media" refers to any medium which can be read and accessed directly by a 
computer. Such media include, but are not limited to: magnetic storage media, such as 
5 floppy discs, hard disc storage medium, and magnetic tape; optical storage media such as 
CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these 
categories such as magnetic/optical storage media. A skilled artisan can readily 
appreciate how any of the presently known computer readable mediums can be used to 
create a manufacture comprising computer readable medium having recorded thereon a 

10 nucleotide sequence of the present invention. As used herein, "recorded" refers to a 
process for storing information on computer readable medium. A skilled artisan can 
readily adopt any of the presently known methods for recording information on computer 
readable medium to generate manufactures comprising the nucleotide sequence 
information of the present invention. 

15 A variety of data storage structures are available to a skilled artisan for creating a 

computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. The choice of the data storage structure will generally be based on the means 
chosen to access the stored information. In addition, a variety of data processor programs 
and formats can be used to store the nucleotide sequence information of the present 

20 invention on computer readable medium. The sequence information can be represented 
in a word processing text file, formatted in commercially-available software such as 
WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored in a 
database application, such as DB2, Sybase, Oracle, or the like. A skilled artisan can 
readily adapt any number of data processor structuring formats (e.g. text file or database) 

25 in order to obtain computer readable medium having recorded thereon the nucleotide 
sequence information of the present invention. 

By providing any of the nucleotide sequences SEQ ID NOs: 1 - 35 or a 
representative fragment thereof; or a nucleotide sequence at least 95% identical to any of 
the nucleotide sequences of the SEQ ID NOs: 1 - 35 in computer readable form, a skilled 

30 artisan can routinely access the sequence information for a variety of purposes. 

Computer software is publicly available which allows a skilled artisan to access sequence 
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information provided in a computer readable medium. The examples which follow 
demonstrate how software which implements the BLAST (Altschul et al. ? J. Mol. Biol. 
215:403-410 (1990)) and BLAZE (Brutlag et al., Comp. Chem. 17:203-207 (1993)) 
search algorithms on a Sybase system is used to identify open reading frames (ORFs) 
5 within a nucleic acid sequence. Such ORFs may be protein encoding fragments and may 
be useful in producing commercially important proteins such as enzymes used in 
fermentation reactions and in the production of commercially useful metabolites. 

As used herein, "a computer-based system" refers to the hardware means, 
software means, and data storage means used to analyze the nucleotide sequence 

1 0 information of the present invention. The minimum hardware means of the 

computer-based systems of the present invention comprises a central processing unit 
(CPU), input means, output means, and data storage means. A skilled artisan can readily 
appreciate that any one of the currently available computer-based systems are suitable for 
use in the present invention. As stated above, the computer-based systems of the present 

15 invention comprise a data storage means having stored therein a nucleotide sequence of 
the present invention and the necessary hardware means and software means for 
supporting and implementing a search means. As used herein, "data storage means" 
refers to memory which can store nucleotide sequence information of the present 
invention, or a memory access means which can access manufactures having recorded 

20 thereon the nucleotide sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which are 
implemented on the computer-based system to compare a target sequence or target 
structural motif with the sequence information stored within the data storage means. 
Search means are used to identify fragments or regions of a known sequence which 

25 match a particular target sequence or target motif. A variety of known algorithms are 

disclosed publicly and a variety of commercially available software for conducting search 
means are and can be used in the computer-based systems of the present invention. 
Examples of such software includes, but is not limited to, Smith-Waterman, MacPattern 
(EMBL), BLASTN and BLASTA (NPOLYPEPTIDEIA), A skilled artisan can readily 

30 recognize that any one of the available algorithms or implementing software packages for 
conducting homology searches can be adapted for use in the present computer-based 
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systems. As used herein, a "target sequence" can be any nucleic acid or amino acid 
sequence of six or more nucleotides or two or more amino acids. A skilled artisan can 
readily recognize that the longer a target sequence is, the less likely a target sequence will 
be present as a random occurrence in the database. The most preferred sequence length 
5 of a target sequence is from about 10 to 300 amino acids, more preferably from about 30 
to 100 nucleotide residues. However, it is well recognized that searches for 
commercially important fragments, such as sequence fragments involved in gene 
expression and protein processing, may be of shorter length. 

As used herein, "a target structural motif," or "target motif," refers to any 

10 rationally selected sequence or combination of sequences in which the sequence(s) are 
chosen based on a three-dimensional configuration which is formed upon the folding of 
the target motif. There are a variety of target motifs known in the art. Protein target 
motifs include, but are not limited to, enzyme active sites and signal sequences. Nucleic 
acid target motifs include, but are not limited to, promoter sequences, hairpin structures 

15 and inducible expression elements (protein binding sequences). 

4.12 TRIPLE HELIX FORMATION 

In addition, the fragments of the present invention, as broadly described, can be 
used to control gene expression through triple helix formation or antisense DNA or RNA, 

20 both of which methods are based on the binding of a polynucleotide sequence to DNA or 
RNA. Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in 
length and are designed to be complementary to a region of the gene involved in 
transcription (triple helix - see Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al, 
Science 15241:456 (1988); and Dervan et al, Science 251:1360 (1991)) or to the mRNA 

25 itself (antisense - Olmno, J. Neurochem. 56:560 (1991); Oligodeoxynucleotides as 

Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988)). Triple 
helix-formation optimally results in a shut-off of RNA transcription from DNA, while 
antisense RNA hybridization blocks translation of an mRNA molecule into polypeptide. 
Both techniques have been demonstrated to be effective in model systems. Information 

30 contained in the sequences of the present invention is necessary for the design of an 
antisense or triple helix oligonucleotide. 
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4.13 DIAGNOSTIC ASSAYS AND KITS 

The present invention further provides methods to identify the presence or 
expression of one of the ORFs of the present invention, or homolog thereof, in a test 
5 sample, using a nucleic acid probe or antibodies of the present invention, optionally 
conjugated or otherwise associated with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the 
polynucleotide for a period sufficient to form the complex, and detecting the complex, so 

10 that if a complex is detected, a polynucleotide of the invention is detected in the sample. 
Such methods can also comprise contacting a sample under stringent hybridization 
conditions with nucleic acid primers that anneal to a polynucleotide of the invention 
under such conditions, and amplifying annealed polynucleotides, so that if a 
polynucleotide is amplified, a polynucleotide of the invention is detected in the sample. 

15 In general, methods for detecting a polypeptide of the invention can comprise 

contacting a sample with a compound that binds to and forms a complex with the 
polypeptide for a period sufficient to form the complex, and detecting the complex, so 
that if a complex is detected, a polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of the 

20 antibodies or one or more of the nucleic acid probes of the present invention and assaying 
for binding of the nucleic acid probes or antibodies to components within the test sample. 

Conditions for incubating a nucleic acid probe or antibody with a test sample 
vary. Incubation conditions depend on the format employed in the assay, the detection 
methods employed, and the type and nature of the nucleic acid probe or antibody used in 

25 the assay. One skilled in the art will recognize that any one of the commonly available 
hybridization, amplification or immunological assay formats can readily be adapted to 
employ the nucleic acid probes or antibodies of the present invention. Examples of such 
assays can be found in Chard, T. 5 An Introduction to Radioimmunoassay and Related 
Techniques, Elsevier Science Publishers, Amsterdam, The Netherlands (1986); Bullock, 

30 G.R. et aL, Techniques in Immunocytochemistry, Academic Press, Orlando, FL Vol. 1 
(1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice and Theory of immunoassays: 
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Laboratory Techniques in Biochemistry and Molecular Biology, Elsevier Science 
Publishers, Amsterdam, The Netherlands (1985). The test samples of the present 
invention include cells, protein or membrane extracts of cells, or biological fluids such as 
sputum, blood, serum, plasma, or urine. The test sample used in the above-described 
5 method will vary based on the assay format, nature of the detection method and the 

tissues, cells or extracts used as the sample to be assayed. Methods for preparing protein 
extracts or membrane extracts of cells are well known in the art and can be readily be 
adapted in order to obtain a sample which is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain 

1 0 the necessary reagents to carry out the assays of the present invention. Specifically, the 
invention provides a compartment kit to receive, in close confinement, one or more 
containers which comprises: (a) a first container comprising one of the probes or 
antibodies of the present invention; and (b) one or more other containers comprising one 
or more of the following: wash reagents, reagents capable of detecting presence of a 

1 5 bound probe or antibody. 

In detail, a compartment kit includes any kit in which reagents are contained in 
separate containers. Such containers include small glass containers, plastic containers or 
strips of plastic or paper. Such containers allows one to efficiently transfer reagents from 
one compartment to another compartment such that the samples and reagents are not 

20 cross-contaminated, and the agents or solutions of each container can be added in a 
quantitative fashion from one compartment to another. Such containers will include a 
container which will accept the test sample, a container which contains the antibodies 
used in the assay, containers which contain wash reagents (such as phosphate buffered 
saline, Tris-buffers, etc.), and containers which contain the reagents used to detect the 

25 bound antibody or probe. Types of detection reagents include labeled nucleic acid probes, 
labeled secondary antibodies, or in the alternative, if the primary antibody is labeled, the 
enzymatic, or antibody binding reagents which are capable of reacting with the labeled 
antibody. One skilled in the art will readily recognize that the disclosed probes and 
antibodies of the present invention can be readily incorporated into one of the established 

30 kit formats which are well known in the art. 
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4.14 MEDICAL IMAGING 

The novel polypeptides and binding partners of the invention are useful in 
medical imaging of sites expressing the molecules of the invention (e.g., where the 
polypeptide of the invention is involved in the immune response, for imaging sites of 
5 inflammation or infection). See, e.g., Kunkel et al., U.S. Pat. NO. 5,413,778, Such 
methods involve chemical attachment of a labeling or imaging agent, administration of 
the labeled polypeptide to a subject in a pharmaceutically acceptable carrier, and imaging 
the labeled polypeptide in vivo at the target site. 

10 4.15 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present 
invention further provides methods of obtaining and identifying agents which bind to a 
polypeptide encoded by an ORF corresponding to any of the nucleotide sequences set 
forth in the SEQ ID NOs: 1 - 35, or bind to a specific domain of the polypeptide encoded 
1 5 by the nucleic acid. In detail, said method comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by an ORF of the 
present invention, or nucleic acid of the invention; and 

(b) determining whether the agent binds to said protein or said nucleic acid. 
In general, therefore, such methods for identifying compounds that bind to a 

20 polynucleotide of the invention can comprise contacting a compound with a 

polynucleotide of the invention for a time sufficient to form a polynucleotide/compound 
complex, and detecting the complex, so that if a polynucleotide/compound complex is 
detected, a compound that binds to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind 

25 to a polypeptide of the invention can comprise contacting a compound with a polypeptide 
of the invention for a time sufficient to form a polypeptide/compound complex, and 
detecting the complex, so that if a polypeptide/compound complex is detected, a 
compound that binds to a polynucleotide of the invention is identified. 

Methods for identifying compounds that bind to a polypeptide of the invention 

30 can also comprise contacting a compound with a polypeptide of the invention in a cell for 
a time sufficient to form a polypeptide/compound complex, wherein the complex drives 
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expression of a receptor gene sequence in the cell, and detecting the complex by 
detecting reporter gene sequence expression, so that if a polyp eptide/compound complex 
is detected, a compound that binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate 
5 the activity of a polypeptide of the invention (that is, increase or decrease its activity, 
relative to activity observed in the absence of the compound). Alternatively, compounds 
identified via such methods can include compounds which modulate the expression of a 
polynucleotide of the invention (that is, increase or decrease expression relative to 
expression levels observed in the absence of the compound). Compounds, such as 
10 compounds identified via the methods of the invention, can be tested using standard 
assays well known to those of skill in the art for their ability to modulate 
activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be 
1 5 selected and screened at random or rationally selected or designed using protein modeling 
techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical 
agents and the like are selected at random and are assayed for their ability to bind to the 
protein encoded by the ORF of the present invention. Alternatively, agents may be 

20 rationally selected or designed. As used herein, an agent is said to be "rationally selected 
or designed" when the agent is chosen based on the configuration of the particular 
protein. For example, one skilled in the art can readily adapt currently available 
procedures to generate peptides, pharmaceutical agents and the like, capable of binding to 
a specific peptide sequence, in order to generate rationally designed antipeptide peptides, 

25 for example see Hurby et al., Application of Synthetic Peptides: Antisense Peptides," In 
Synthetic Peptides, A User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and 
Kaspczak et al, Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like. 

In addition to the foregoing, one class of agents of the present invention, as 
broadly described, can be used to control gene expression through binding to one of the 

30 ORFs or EMFs of the present invention. As described above, such agents can be 

randomly screened or rationally designed/selected. Targeting the ORF or EMF allows a 
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skilled artisan to design sequence specific or element specific agents, modulating the 
expression of either a single ORF or multiple ORFs which rely on the same EMF for 
expression control. One class of DNA binding agents are agents which contain base 
residues which hybridize or form a triple helix formation by binding to DNA or RNA. 
5 Such agents can be based on the classic phosphodiester, ribonucleic acid backbone, or 
can be a variety of sulfhydryl or polymeric derivatives which have base attachment 
capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and are 
designed to be complementary to a region of the gene involved in transcription (triple 

10 helix - see Lee et al, Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 241 :456 
(1988); and Dervan et al., Science 251:1360 (1991)) or to the mRNA itself (antisense - 
Okano, J. Neurochem. 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of 
Gene Expression, CRC Press, Boca Raton, FL (1988)). Triple helix- formation optimally 
results in a shut-off of RNA transcription from DNA, while antisense RNA hybridization 

1 5 blocks translation of an mRNA molecule into polypeptide. Both techniques have been 
demonstrated to be effective in model systems. Information contained in the sequences 
of the present invention is necessary for the design of an antisense or triple helix 
oligonucleotide and other DNA binding agents. 

Agents which bind to a protein encoded by one of the ORFs of the present 

20 invention can be used as a diagnostic agent. Agents which bind to a protein encoded by 
one of the ORFs of the present invention can be formulated using known techniques to 
generate a pharmaceutical composition. 

4.16 USE OF NUCLEIC ACIDS AS PROBES 

25 Another aspect of the subject invention is to provide for polypeptide-specific 

nucleic acid hybridization probes capable of hybridizing with naturally occurring 
nucleotide sequences. The hybridization probes of the subject invention may be derived 
from any of the nucleotide sequences SEQ ID NOs: 1-35. Because the corresponding 
gene is only expressed in a limited number of tissues, a hybridization probe derived from 

30 of any of the nucleotide sequences SEQ ID NOs: 1 - 35 can be used as an indicator of the 
presence of RNA of cell type of such a tissue in a sample. 
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Any suitable hybridization technique can be employed, such as, for example, in 
situ hybridization. PCR as described in US Patents Nos. 4,683,195 and 4,965,188 
provides additional uses for oligonucleotides based upon the nucleotide sequences. Such 
probes used in PCR may be of recombinant origin, may be chemically synthesized, or a 
5 mixture of both. The probe will comprise a discrete nucleotide sequence for the detection 
of identical sequences or a degenerate pool of possible sequences for identification of 
closely related genomic sequences. 

Other means for producing specific hybridization probes for nucleic acids include 
the cloning of nucleic acid sequences into vectors for the production of mRNA probes. 

10 Such vectors are known in the art and are commercially available and may be used to 
synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerase as T7 or SP6 RNA polymerase and the appropriate radioactively labeled 
nucleotides. The nucleotide sequences may be used to construct hybridization probes for 
mapping their respective genomic sequences. The nucleotide sequence provided herein 

15 may be mapped to a chromosome or specific regions of a chromosome using well known 
genetic and/or chromosomal mapping techniques. These techniques include in situ 
hybridization, linkage analysis against known chromosomal markers, hybridization 
screening with libraries or flow-sorted chromosomal preparations specific to known 
chromosomes, and the like. The technique of fluorescent in situ hybridization of 

20 chromosome spreads has been described, among other places, in Verma et al (1988) 

Human Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York NY. 

Fluorescent in situ hybridization of chromosomal preparations and other physical 
chromosome mapping techniques may be correlated with additional genetic map data. 
Examples of genetic map data can be found in the 1994 Genome Issue of Science 

25 (265; 198 If). Correlation between the location of a nucleic acid on a physical 

chromosomal map and a specific disease (or predisposition to a specific disease) may 
help delimit the region of DNA associated with that genetic disease. The nucleotide 
sequences of the subject invention maybe used to detect differences in gene sequences 
between normal, carrier or affected individuals. 
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4.17 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides, i.e., small nucleic acid segments, may be readily prepared by, for 
example, directly synthesizing the oligonucleotide by chemical means, as is commonly 
practiced using an automated oligonucleotide synthesizer. 
5 Support bound oligonucleotides may be prepared by any of the methods known to 

those of skill in the art using any suitable support such as glass, polystyrene or Teflon. One 
strategy is to precisely spot oligonucleotides synthesized by standard synthesizers. 
Immobilization can be achieved using passive adsorption (Inouye & Hondo, (1990) J. Clin. 
Microbiol. 28(6) 1469-72); using UV light (Nagata et al, 1985; Dahlen et al, 1987; 
10 Morrissey & Collins, (1989) Mol. Cell Probes 3(2) 189-207) or by covalent binding of base 
modified DNA (Keller et al, 1988; 1989); all references being specifically incorporated 
herein. 

Another strategy that may be employed is the use of the strong biotin-streptavidin 
interaction as a linker. For example, Broude et al (1994) Proc. Natl. Acad. Sci. USA 91(8) 

1 5 3072-6, describe the use of biotinylated probes, although these are duplex probes, that are 
immobilized on streptavidin-coated magnetic beads. Streptavidin-coated beads may be 
purchased from Dynal, Oslo. Of course, this same linking chemistry is applicable to coating 
any surface with streptavidin. Biotinylated probes may be purchased from various sources, 
such as, e.g., Operon Technologies (Alameda, CA). 

20 Nunc Laboratories (Naperville, IL) is also selling suitable material that could be 

used. Nunc Laboratories have developed a method by which DNA can be covalently bound 
to the microwell surface termed Covalink NH. CovaLink NH is a polystyrene surface 
grafted with secondary amino groups (>NH) that serve as bridge-heads for further covalent 
coupling. CovaLink Modules may be purchased from Nunc Laboratories. DNA molecules 

25 may be bound to CovaLink exclusively at the 5-end by a phosphoramidate bond, allowing 
immobilization of more than 1 pmol of DNA (Rasmussen et al, (1991) Anal Biochem. 
198(1) 138-42). 

The use of CovaLink NH strips for covalent binding of DNA molecules at the 5-end 
has been described (Rasmussen et al., (1991). In this technology, a phosphoramidate bond 
30 is employed (Chu et al., (1983) Nucleic Acids Res. 11(8) 6513-29). This is beneficial as 
immobilization using only a single covalent bond is preferred. The phosphoramidate bond 
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joins the DNA to the CovaLink NH secondary amino groups that are positioned at the end 
of spacer arms covalently grafted onto the polystyrene surface through a 2 nm long spacer 
arm. To link an oligonucleotide to CovaLink NH via an phosphoramidate bond, the 
oligonucleotide terminus must have a 5-end phosphate group. It is, perhaps, even possible 
5 for biotin to be covalently bound to CovaLink and then streptavidin used to bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) 
and denaturing for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0.1 M 
1-methylimidazole, pH 7.0 (l-Melmy), is then added to a final concentration of 10 mM 
l-Melmy. A ss DNA solution is then dispensed into CovaLink NH strips (75 ul/well) 

10 standing on ice. 

Carbodiimide 0.2 M l-ethyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC), 
dissolved in 10 mM l-Melmy, is made fresh and 25 ul added per well. The strips are 
incubated for 5 hours at 50°C. After incubation the strips are washed using, e.g., 
Nunc-Immuno Wash; first the wells are washed 3 times, then they are soaked with washing 

15 solution for 5 min., and finally they are washed 3 times (where in the washing solution is 0.4 
N NaOH, 0.25% SDS heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention is 
that described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated 
herein by reference. This method of preparing an oligonucleotide bound to a support 

20 involves attaching a nucleoside 3'-reagent through the phosphate group by a covalent 
phosphodiester link to aliphatic hydroxyl groups carried by the support. The 
oligonucleotide is then synthesized on the supported nucleoside and protecting groups 
removed from the synthetic oligonucleotide chain under standard conditions that do not 
cleave the oligonucleotide from the support. Suitable reagents include nucleoside 

25 phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparation of DNA probe for the preparation of DNA 
probe arrays may be employed. For example, addressable laser-activated photodeprotection 
may be employed in the chemical synthesis of oligonucleotides directly on a glass surface, 
as described by Fodor et al (1991) Science 251(4995) 767-73, incorporated herein by 

30 reference. Probes may also be immobilized on nylon supports as described by Van Ness et 
al (1991) Nucleic Acids Res. 19(12) 3345-50; or linked to Teflon using the method of 
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Duncan & Cavalier (1988) Anal. Biochem. 169(1) 104-8; all references being specifically 
incorporated herein. 

To link an oligonucleotide to a nylon support, as described by Van Ness et al 
(1991), requires activation of the nylon surface via alkylation and selective activation of the 
5 5 r -amine of oligonucleotides with cyanuric chloride. 

One particular way to prepare support bound oligonucleotides is to utilize the 
light-generated synthesis described by Pease et al, (1994) PNAS USA 91(1 1) 5022-6, 
incorporated herein by reference). These authors used current photolithographic techniques 
to generate arrays of immobilized oligonucleotide probes (DNA chips). These methods, in 
1 0 which light is used to direct the synthesis of oligonucleotide probes in high-density, 
miniaturized arrays, utilize photolabile 5-protected N-acyl-deoxynucleoside 
phosphoramidites, surface linker chemistry and versatile combinatorial synthesis strategies. 
A matrix of 256 spatially defined oligonucleotide probes may be generated in this manner. 

4.18 PREPARATION OF NUCLEIC ACID FRAGMENTS 

1 5 The nucleic acids may be obtained from any appropriate source, such as cDNAs, 

genomic DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC 
inserts, and RNA, including mRNA without any amplification steps. For example, 
Sambrook et al (1989) describes three protocols for the isolation of high molecular weight 
DNA from mammalian cells (p. 9.14-9.23). 

20 DNA fragments may be prepared as clones in M13, plasmid or lambda vectors 

and/or prepared directly from genomic DNA or cDNA by PCR or other amplification 
methods. Samples may be prepared or dispensed in multiwell plates. About 100-1000 ng of 
DNA samples may be prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those 

25 of skill in the art including, for example, using restriction enzymes as described at 9.24-9.28 
of Sambrook et al (1989), shearing by ultrasound and NaOH treatment. 

Low pressure shearing is also appropriate, as described by Schriefer et al (1990) 
Nucleic Acids Res. 18(24) 7455-6, incorporated herein by reference). In this method, DNA 
samples are passed through a small French pressure cell at a variety of low to intermediate 

30 pressures. A lever device allows controlled application of low to intermediate pressures to 

789CIP2C 98 



the cell. The results of these studies indicate that low-pressure shearing is a useful 

alternative to sonic and enzymatic DNA fragmentation methods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using 

the two base recognition endonuclease, Cv/JI, described by Fitzgerald et al (1992) Nucleic 
5 Acids Res. 20(14) 3753-62. These authors described an approach for the rapid 

fragmentation and fractionation of DNA into particular sizes that they contemplated to be 

suitable for shotgun cloning and sequencing. 

The restriction endonuclease CV/JI normally cleaves the recognition sequence 

PuGCPy between the G and C to leave blunt ends. Atypical reaction conditions, which alter 
10 the specificity of this enzyme (CvzJI**), yield a quasi-random distribution of DNA 

fragments form the small molecule pUC19 (2688 base pairs). Fitzgerald et al (1992) 

quantitatively evaluated the randomness of this fragmentation strategy, using a Cv/JI** 

digest of pUC19 that was size fractionated by a rapid gel filtration method and directly 

ligated, without end repair, to a lac Z minus M13 cloning vector. Sequence analysis of 76 
1 5 clones showed that Cv/JI** restricts pyGCPy and PuGCPu, in addition to PuGCPy sites, and 

that new sequence data is accumulated at a rate consistent with random fragmentation. 

As reported in the literature, advantages of this approach compared to sonication and 

agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 ug instead 

of 2-5 ug); and fewer steps are involved (no preligation, end repair, chemical extraction, or 
20 agarose gel electrophoresis and elution are needed 

Irrespective of the manner in which the nucleic acid fragments are obtained or 

prepared, it is important to denature the DNA to give single stranded pieces available for 

hybridization. This is achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. 

The solution is then cooled quickly to 2°C to prevent renaturation of the DNA fragments 
25 before they are contacted with the chip. Phosphate groups must also be removed from 

genomic DNA by methods known in the art. 

4.19 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon 
membrane. Spotting may be performed by using arrays of metal pins (the positions of which 
30 correspond to an array of wells in a microliter plate) to repeated by transfer of about 20 nl of 
a DNA solution to a nylon membrane. By offset printing, a density of dots higher than the 
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density of the wells is achieved. One to 25 dots may be accommodated in 1 mm , 
depending on the type of label used. By avoiding spotting in some preselected number of 
rows and columns, separate subsets (subarrays) may be formed. Samples in one subarray 
may be the same genomic segment of DNA (or the same gene) from different individuals, or 
5 may be different, overlapped genomic clones. Each of the subarrays may represent replica 
spotting of the same samples. In one example, a selected gene segment maybe amplified 
from 64 patients. For each patient, the amplified gene segment may be in one 96-well plate 
(all 96 wells containing the same sample). A plate for each of the 64 patients is prepared. By 
using a 96-pin device, all samples maybe spotted on one 8 x 12 cm membrane. Subarrays 

1 0 may contain 64 samples, one from each patient. Where the 96 subarrays are identical, the 
dot span may be 1 mm 2 and there may be a 1 mm space between subarrays. 

Another approach is to use membranes or plates (available from NUNC, Naperville, 
Illinois) which may be partitioned by physical spacers e.g. a plastic grid molded over the 
membrane, the grid being similar to the sort of membrane applied to the bottom of multiwell 

1 5 plates, or hydrophobic strips. A fixed physical spacer is not preferred for imaging by 
exposure to flat phosphor-storage screens or x-ray films. 

The present invention is illustrated in the following examples. Upon consideration 
of the present disclosure, one of skill in the art will appreciate that many other embodiments 
and variations may be made in the scope of the present invention. Accordingly, it is 

20 intended that the broader aspects of the present invention not be limited to the disclosure of 
the following examples. The present invention is not to be limited in scope by the 
exemplified embodiments which are intended as illustrations of single aspects of the 
invention, and compositions and methods which are functionally equivalent are within the 
scope of the invention. Indeed, numerous modifications and variations in the practice of the 

25 invention are expected to occur to those skilled in the art upon consideration of the present 
preferred embodiments. Consequently, the only limitations which should be placed upon 
the scope of the invention are those which appear in the appended claims. 

All references cited within the body of the instant specification are hereby 
incorporated by reference in their entirety. 
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5.0 EXAMPLES 



5.1 EXAMPLE 1 

Novel Nucleic Acid Sequences Obtained From Various Libraries 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared from 
5 various human tissues and in some cases isolated from a genomic library derived from 
human chromosome using standard PCR, SBH sequence signature analysis and Sanger 
sequencing techniques. The inserts of the library were amplified with PCR using primers 
specific for the vector sequences which flank the inserts. Clones from cDNA libraries were 
spotted on nylon membrane filters and screened with oligonucleotide probes (e.g., 7-mers) 
10 to obtain signature sequences. The clones were clustered into groups of similar or identical 
sequences. Representative clones were selected for sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a 
typical Sanger sequencing protocol. PCR products were purified and subjected to 
fluorescent dye terminator cycle sequencing. Single pass gel sequencing was done using a 
1 5 377 Applied Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. In 

some cases RACE (Random Amplification of cDNA Ends) was performed to further extend 
the sequence in the 5' direction. 

5.2 EXAMPLE 2 
Novel Nucleic Acids 

20 The novel nucleic acids of the present invention of the invention were assembled 

from sequences that were obtained from a cDNA library by methods described in Example 
1 above, and in some cases sequences obtained from one or more public databases. The 
nucleic acids were assembled using an EST sequence as a seed. Then a recursive algorithm 
was used to extend the seed EST into an extended assemblage, by pulling additional 

25 sequences from different databases (i.e., Hyseq's database containing EST sequences, 

dbEST version 1 14, gb pri 1 14, and UniGene version 101) that belong to this assemblage. 
The algorithm terminated when there was no additional sequences from the above databases 
that would extend the assemblage. Inclusion of component sequences into the assemblage 
was based on a BLASTN hit to the extending assemblage with BLAST score greater than 

30 300 and percent identity greater than 95%. 
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Using PHRAP (Univ. of Washington) or CAP4 (Paracel), a full length gene cDNA 
sequence and its corresponding protein sequence were generated from the assemblage. Any 
frame shifts and incorrect stop codons were corrected by hand editing. During editing, the 
sequence was checked using FASTY and/or BLAST against Genbank (i.e., dbEST version 
5 1 18, gb pri 1 18, UniGene version 118, Genepet release 118). Other computer programs 
which may have been used in the editing process were phredPhrap and Consed (University 
of Washington) and ed-ready, ed-ext and gc-zip-2 (Hyseq, Inc.). The full-length nucleotide 
and amino acid sequences, including splice variants resulting from these procedures are 
shown in the Sequence Listing as SEQ ID NOS: 1- 35. 

10 Table 1 shows the various tissue sources of SEQ ED NO: 1-35. 

The homology for SEQ ID NO: 1-35 were obtained by a BLASTP version 2.0al 
19MP-WashU search against Genpept release 118, using BLAST algorithm. The results 
showed homologues for SEQ ID NO: 1-35 from Genpept. The homologues with 
identifiable functions for SEQ ID NO: 1-35 are shown in Table 2 below. 

15 Using eMatrix software package (Stanford University, Stanford, CA) (Wu et al., 

J. Comp. Biol., Vol. 6 pp. 219-235 (1999) herein incorporated by reference), all the 
sequences were examined to determine whether they had identifiable signature regions. 
Table 3 shows the signature region found in the indicated polypeptide sequences, the 
description of the signature, the eMatrix p-value(s) and the position(s) of the signature 

20 within the polypeptide sequence. 

Using the pFam software program (Sonnhammer et al, Nucleic Acids Res., Vol. 
26(1) pp. 320-322 (1998) herein incorporated by reference) all the polypeptide sequences 
were examined for domains with homology to certain peptide domains. Table 4 shows 
the name of the domain found, the description, the p-value and the pFam score for the 

25 identified domain within the sequence. 

The nucleotide sequence within the sequences that codes for signal peptide 
sequences and their cleavage sites can be determine from using Neural Network SignalP 
VI. 1 program (from Center for Biological Sequence Analysis, The Technical University 
of Denmark). The process for identifying prokaryotic and eukaryotic signal peptides and 

30 their cleavage sites are also disclosed by Henrik Nielson, Jacob Engelbrecht, Soren 
Brunak, and Gunnar von Heijne in the publication " Identification of prokaryotic and 
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eukaryotic signal peptides and prediction of their cleavage sites" Protein Engineering, 
Vol. 10, no. 1, pp. 1-6 (1997), incorporated herein by reference. A maximum S score and 
a mean S score, as described in the Nielson et as reference, was obtained for the 
polypeptide sequences. Table 5 shows the position of the signal peptide in each of the 
polypeptides and the maximum score and mean score associated with that signal peptide. 
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TABLE 1 



TISSUE ORIGIN 


RNA SOURCE 


HYSEQ 

T TDT1 An \T 
JXIAlYLCi 


bby ID JMUo : 


adult brain 


GIBC0 


AB3001 


1 5 11 19 23-24 35 


adult brain 


GIBCO 


ABD0 03 


J O i-X X3 Xo ZU ZO ZO J'i 

^ R 
j D 


adult brain 




Clontech 


iiCKU u x 


R - 7 11 1 Q 

D _ / XX X .7 


adult brain 


Clontech 


a tap nn^ 


£ 10 19 1 R 1ft 90 27 

O X U 1Z XJ io z u ^ / 


adult brain 


Clontech 


ABR00 8 


1-4 6-11 14 16 18-19 21-22 

9 £ - 9 fi ^0-^1 1 R 
z o Z O J U J X -J -J 


adult brain 


— i 1 — 

BioChain 


i-Uj-KU XZ 


in "3 r 

X U J j 


adult brain 


Invitrogen 


7\T3"D 111 A 


i n 
x u 


adult brain 


Invitrogen 


ABT004 


2 6 16 21-22 24 


cultured 
preadipocytes 


Strategene 


ADPO 01 


1 A O /I O /C "3-1 *3 O 

10 24 ZD il iJ 


adrenal gland 


Clontech 




D I XX Z ft Z O 


adult heart 


GIBCO 


AHRUU 1 


i "2 R q 1 n_i i i /i _i c OR "3 R 
1 J D _ 0 1U XX X*± XO ZD jj JJ 


adult kidney- 


GIBCO 


7\ V"Pi A A 1 

AivUU U 1 


"5 R « 11 91-99 94. 9 R 

,5 j O XX 1j XD ZX ZZ ^ *± ZiO 

J U J J 


adult kidney 


Invitrogen , 


AKT002 


8 10 12 15 18 21-22 26 28 


adult lung 


GIBCO 


7\ t pnm 


in i *3 i c 
_LU jj jj 


young liver 


GIBCO 


AL V U U 1 


O 1 A 1 7 91-99 9 A 9 
Z X*t X/ ZX ZZ Z*± ZD 


adult liver 


Invitrogen 


7\ T T 7 A A O 

ALVO 0 z 


in 1Q "> /I OQ 
1 / - 1 o Z4 ZO 


adult liver 


Clontech 


ALVO 0 3 


17 


adult ovary 


Invitrogen 


AOV0 0 1 


1-2 5-6 o lU-il ±4 lb Xo-ZZ 
oa o a oq on it "ic 

Z4 ZD ZO JU JJ 


adult placenta 


Clontech 


7\ "dt nm 
AJr LiU U X 


o 


placenta 


Invitrogen 


7\ TIT A A O 


in OA 1 R 


adult spleen 


GIBCO 


A CD A A 1 


O 7 1 D-1 1 1 P 9A-9R 9 R *2 7 

x, / XU XX XO ^J ^-i vJ JJ 


testis 


GIBCO 


ATS001 


21-22 26 28 30 


aciuiu DiauQer 


iiiviLi uycii 


J-j J-JJ-^ w w -L. 


10 


bone marrow 


Clontech 


BMD001 


1 11 21-22 31 33 35 


bone marrow 


f~* ~\ ✓-v>-i 4— /-i l-i 

Lioiitecn 




1 10-11 14 25-26 28 


adult colon 


Invitrogen 


CLN001 


5 11 16 


Mixture of 16 
tissues - mRMAs 


Various Vendors* 


CTliO 16 


X 


Mixture of 16 
tissues - mRNAs* 


Various Vendors* 


CTL021 


17 


Mixture of 16 
tissues - mRNAs* 


T 7a >~ "i nn a VpnHnyQ* 
Val lUUo v C11U.UJ. c? 


CTLtO 2 8 

\w X -L-I \J \J 


10 


adult cervix 


BioChain 


CVX001 


1-2 10-11 14 18 28 33 35 


endothelial cells 


Strategene 


EDT001 


6 8 10-11 21-22 24 26 33 35 


fetal brain 


Clontech 


FBR004 


2 20 26 30 


fetal brain 


Clontech 


FBR0 0 6 


2 6 8-9 11 16 18 21-22 27 30 



* The 16 tissue-mRNAs and their vendor source, are as follows: 1) Normal adult brain mRNA (Invitrogen), 2) 
normal adult kidney mRNA (Invitrogen), 3) normal adult liver mRNA (Invitrogen), 4) normal fetal brain mRNA 
(Invitrogen), 5) normal fetal kidney mRNA (Invitrogen), 6) normal fetal liver mRNA (Invitrogen), 7) normal fetal 
skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontech), 9) human bone marrow mRNA (Clontech), 10) 
human leukemia lymphablastic mRNA (Clontech), 11) human thymus mRNA (Clontech), 12) human lymph node 
mRNA (Clontech), 13) human spinal cord mRNA (Clontech), 14) human thyroid mRNA (Clontech), 15) human 
esophagus mRNA (BioChain), 16) human conceptional umbilical cord mRNA (BioChain). 
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TABLE 1 



TISSUE ORIGIN 




HYSEQ 
LIBRARY 
NAME 


SEQ ID NOS: 








35 


fetal brain 


Clontech 


FBRS03 


21-22 


fetal brain 


xii v x l. x uycu 


FBT002 


2 8-10 18-19 24 26 30 


fetal kidney 


Clontech 


FKD001 


7 28 


fetal lung 


\^ 1 OllU C Oil 


FLG0 01 


10 


fetal lung 


Invitrogen 


FLG003 


10 24 26 


fetal liver-spleen 


Columbia 
university 


r jjOvui 


2 6-8 10-14 17 21-22 24 26 
28 31 34-35 


fetal liver-spleen 


LOlUTuDia 
uii-i vci biuy 


FT. SO 02 


3 5-6 8 11 13 16 18 21-22 
24-26 28 30 32-35 


fetal liver-spleen 


0 1 unuj J- ci 

University 


FLS0 03 


10 17-18 26 


fetal liver 


inviuruycii 


FLV001 


14 17 24 33 35 


fetal liver 


LlOIiLcLIl 


FLV004 


10 14 28 


fetal muscle 


X 11 V X UX Gil 


FMS0 01 


3 6 10 26 31 


fetal muscle 


xnv x li uy en 


FMS0 02 


10 25 30 


fetal skin 


TriTri 4~ yArton 

mviui uycii 


FSK001 


3 5-6 10 16-17 21-23 25-26 
28 30-31 


fetal skin 


iiiviuioycii 


FSK0 02 

X X\. W V 4-J 


10 18 26 


fetal spleen 


BioChain 


FSP001 


10 


umbilical cord 


BioChain 


PTTPO n 1 
r uuuui 


3 10 20 24 26 30 33 


fetal brain 


GIBCO 


HFB001 


3-6 8 10-11 15 18 21-24 35 


infant brain 


Columbia 
University 


TT30 n O 0 
Ll5Z U U Z 


0 A 7-8 10 14 16 18-19 21-23 
26 28 35 


infant brain 


Columbia 
University 


1.DZ UU j 


3 5 8-10 16 20 23 26 29 35 


infant brain 


Columbia 
univei o x uy 




19 


infant brain 


Columbia 

TTn "i trove i \~\r 


IBS001 


26 35 


lung, fibroblast 


Strategene 


LFB001 


10 21-22 30 33 


lung tumor 


Invitrogen 


Xjo 


2 10-11 14-15 24 28 30 35 


lymphocytes 


ATCC 


LPC001 


1 3 11 18 28 35 


leukocyte 


GIBCO 


LiUUU U 1 


1 _7 c;_7 ii 16 1 8 21-22 24-26 

X «J J / XX _LVJ _LlJ ^jJ. ^ - 1 - ^ " 

2 8 35 


melanoma from cell 

-i • _ Tintnn Ji/-inT T A O A 

line ATCC #CRIi 142 4 




Clontech 


i v I.Cj DU w*± 


2 24 26 


mammary gland 


HIV X LX vjy Cil 


MMG001 


2 6 10 12 14 16 18-19 24 26 
28 31 35 




induced neuron cells 


burauegene 


MTD 0 01 

IN 1JJ v U i 


15 23 


neuronal cells 


O 4— -v~ -a •(— QAOVi O 

b uraLcyciic 


NTUO 01 


10 26 


placenta 


Liontecn 


PTiAO 0^ 


33 


prostate 


Lionnecn 


Ir PL 1 U u 1 


7-8 10-11 14 16 21-22 24 


rectum 


invitrogen 




6 26 28 


salivary gland 


L.IUI1L.CLI1 


SAL0 01 


10 16 21-22 35 


sKin riDroDiasu 


ATPp 
H1UL 


SFB001 


10 


small intestine 


Clontech 


SIN001 


3 5 10 14 24 30 


skeletal muscle 


Clontech 


SKM001 


10 15 


spinal cord 


Clontech 


SPC001 


18 26 28 


adult spleen 


Clontech 


SPLcOl 


1 14 26 


stomach 


Clontech 


ST0001 


16 


thalamus 


Clontech 


THA002 


2-3 24 26 
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TABLE 1 



TISSUE ORIGIN 




HYSEQ 
T.TRR ARY 
NAME 


SEQ ID NOS: 


thymus 


Clontech 


THMC02 


3 6-7 14 17 26 31-32 35 


thyroid gland 


Clontech 


THRO 01 


3 6 8 10-11 17-18 21-22 24 
26 28 33 


trachea 


Clontech 


TRC001 


10 21-22 33 


uterus 


Clontech 


UTR001 


10-11 24 



o 
-fl 
m 

m 
m 

UJ 

m 
a 

a 

Q 
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TABLE 2 



SEQ ID 
NO: 


CORRESPONDING 
SEQ ID NO , liN 
U.S. S.N 

09/5 74, 4b4 


ACCESSION 


DESCRIPTION 


SMITH- 
WATERMAN 
SCORE 


% IDENTITY 


1 


1621 


AL049785 


Homo sapiens 
hypothetical 
protein 


761 


100 


2 


1724 


U20657 


Homo sapiens 
ubiquitin protease 


719 


47 


3 


2106 


Y14494 


Homo sapiens 
aralarl 


3462 


99 


4 


2500 


U00051 


Caenorhabditis 
elegans coded for 
Ktt n p>lpaanFi cDNA 
yk50b2.5; coded for 
by C. elegans cDNA 
CEESV26F; similar 
i-o l inaqps over a 
short region 


414 


33 


5 


2501 


ADZ D / D 


D*rosonhi la 
melanogaster 

nhi rrn "l 1" 1 n - 

coni ugating enzyme 


78 


26 


6 


2507 


u ± / u v o 


Homo saoiens 
PMS2L14 


1552 


99 


7 


2520 


AF222766 


Bos taurus ankyrin 
1 


835 


31 


8 


2555 


i-l_D U J -? D / V 


Homo sapiens ALEX1 


2332 


100 


9 


3766 


U51000 


Mus musculus DLX-1 


1319 


98 


10 


3935 


X66405 


Mus musculus 
collagen alphal 
type Vl-precursor 


5011 


89 


11 


4297 


AF2 08o5o 


Womo d3ni pns BM-014 


1012 


99 


12 


4333 


X62677 


cuniculus 

retrovirus related 
transcriptase 


75 


52 


13 


4449 


M14912 


Homo sapiens pol 


132 


86 


14 


4562 


AB024028 


Arabidopsis 
thaliana uridine 
kinase-like protein 


1021 


45 


15 


4591 


L11275 


Saccharomyces 
cerevisiae selected 
as a weak 
suppressor of a 
mutant of the 

cnhn-n-it ACA 0 of DNA 
dependant RNA 
polymerase I and 
III 


97 


19 


16 


4614 


AL110500 


Caenorhabdi t i s 
elegans Y87G2A.13 


260 


24 


17 


4711 


A14 82 9 


Homo sapiens 

preproapolipoprotei 

n 


974 


100 
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TABLE 2 



SEQ ID 
NO: 


CORRESPONDING 
SEQ ID NO. IN 
U.S. S.N 
09/574,454 


ACCESSION 
NUMBER 


DESCRIPTION 


SMITH- 
WATERMAN 
SCORE 


% IDENTITY 


18 


4726 


AF151086 


Homo sapiens 
HSPC252 


1223 


81 


19 


4737 


AF083116 


Homo sapiens 
paraneoplastic 
cancer- test is -brain 
antigen 


122 


42 


20 


4762 


AC006963 


Homo sapiens 
similar to Kelch 
proteins; similar 
to BAA77027 
(PID:g4650844) 


1226 


43 


21 


4790 


L39995 


Ceratovacuna 
lanigera cytochrome 
oxidase I 


76 


28 


22 


4790 


L39995 


Ceratovacuna 
lanigera cytochrome 
oxidase I 


76 


28 


23 


4828 


AB035266 


Homo sapiens 
neurexin II 


9034 


100 


24 


4999 


M37194 


Rattus norvegicus 
clathrin-associated 
protein 17 


704 


91 


25 


5155 


AF217516 


Homo sapiens 
uncharacterised 
bone marrow protein 
BM040 


669 


100 


26 


5244 


U35776 


Rattus norvegicus 
ADP-ribosylation 
factor 1-directed 
GTPase activating 
protein 


1849 


84 


27 


5280 


AF159852 


Drosophila 
melanogaster RNA- 
binding protein 
Smaug 


259 


44 


28 


5502 


Y13247 


Homo sapiens FBI 9 
protein 


5157 


99 


29 


5813 


AF116638 


Homo sapiens 
PR01546 


70 


48 


30 


5868 


L21013 


Dictyostelium 
discoideum RabC 


12 6 


25 


31 


5890 


AL031033 


Homo sapiens 
C321D2.4 (novel 
protein) 


960 


100 


32 


6070 


X58236 


Homo sapiens 3 6/8-8 
fusion protein with 
epitope for anti- 
lectin antibody 


90 


73 


33 


6245 


AF095593 


Homo sapiens 
caveolin-1 


946 


100 


34 


6277 


AF13 3 521 


Libellula pulchella 
troponin T 


45 


43 
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TABLE 2 



SEQ ID 
NO: 


CORRESPONDING 
SEQ ID NO. IN 
U.S. S.N 
09/574, 454 


ACCESSION 
NUMBER 


DESCRIPTION 


SMITH - 
WATERMAN 
SCORE 


% IDENTITY 


35 


6298 


AF161491 


Homo sapiens 
HSPC142 


1379 


100 
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TABLE 3 



ACCESSION 
NO. 



DM01117 



BL00972 



DESCRIPTION 



2 kw TRANSPOSASE 
WITHIN TRANSPOSITION 
VASOTOCIN. 



Ubiquitin carboxyl- 
terminal hydrolases 
family 2 proteins. 



RESULTS* 



DM01117A 11.17 9.173e-06 52-70 



BL00972A 11.93 5.091e-19 498-516 

BL00972D 22.55 4.000e-15 1148-1173 

BL00972C 16.48 9.143e-12 642-657 

BL00972E 20.72 4.462e-ll 1182-1204 

BL00972B 9.45 4.176e-09 584-594 



BL00215 



Mitochondrial energy 
transfer proteins. 



BL00215A 15.82 1.600e-15 333-358 

BL00215A 15.82 1.794e-14 433-458 

BL00215A 15.82 4.441e-14 525-550 

BL00215B 10.44 3.250e-10 569-582 

BL00215B 10.44 1.000e-08 381-394 



PF00756 



Putative esterase. 



PF00756C 14.12 2.084e-09 206-236 



PR00701 



BL00058 



PF00023 



DM00892 



PR00031 



60KD INNER MEMBRANE 
PROTEIN SIGNATURE 



PR00701I 8.59 6.256e-06 19-43 



DNA mismatch repair 
proteins mutL / hexB 
PMS1 proteins. 



BL00058A 
BL00058B 
BL00058C 
BL00058C 



20 .73 
16 . 83 
18.05 
18.05 



5 .125e 
3 .721e 
7 .568e 
8 .235e 



-33 108-159 

-29 172-211 

-16 240-258 

-13 36-54 



Ank repeat proteins. 



PF00023A 
PF00023A 
PF00023A 
PF00023B 
PF00023B 
PF00023B 
PF00023B 
PF00023A 
PF00023A 
PF00023B 
PF00023B 
PF00023B 
PF00023A 
PF00023B 
PF00023A 



16. 


03 


7. 


600e- 


13 


177 


-193 


16 


03 


8 . 


286e- 


11 


144 


-160 


16 


03 


2 . 


500e- 


10 


760 


-776 


14 


20 


5 


OOOe- 


10 


339 


-349 


14 


20 


5 


500e- 


10 


239 


-249 


14 


20 


6 


500e- 


10 


471 


-481 


14 


20 


8 


OOOe- 


10 


823 


-833 


16 


.03 


8 


875e- 


10 


554 


-570 


16 


.03 


1 


OOOe- 


09 


45- 


61 


14 


.20 


2 


.227e- 


09 


438 


-448 


14 


.20 


2 


.636e- 


09 


272 


-282 


14 


.20 


2 


.636e- 


09 


583 


-593 


16 


. 03 


5 


. 821e- 


09 


587 


-603 


14 


.20 


7 


. 955e- 


09 


405 


-415 


16 


.03 


9 


.679e- 


09 


210 


-226 



3 RETROVIRAL 
PROTEINASE. 



DM00892B 9.78 4.767e-06 27-33 



LAMBDA AND OTHER 
REPRESSOR HELIX-TURN- 
HELIX SIGNATURE 



PR00031B 16.29 1.643e-17 166-183 
PR00031A 8.77 9.400e-09 157-167 



PR00159 



2FE-2S FERREDOXIN 
SIGNATURE 



PR00159B 8.50 7.882e-09 604-612 



PR00138 



MATRIX IN SIGNATURE 



PR00138D 16.56 4.517e-09 77-103 



BL00597 



Plant lipid transfer 
proteins . 



BL00597B 12.41 7.955e-06 11-40 



BL00366 



PF00602 



DM01688 



Uricase proteins. 



Influenza RNA- 
dependant RNA 
polymerase subunit 
PB1. 



BL00366A 11.55 9.780e-06 25-39 
PF00602E 11.62 5.160e-06 95-142 



I 2 POLY-IG RECEPTOR. 



DM01688K 17.19 5.938e-06 321-360 



* Results include in order: accession number 
position of signature in amino acid sequence 



subtype; raw score; p-value; 
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TABLE 3 



SEQ ID 
NO: 


ALUn.bbxU.IN 
MO. 


i_J J_l kj V_ -LV J_ J- -L 


RESULTS* 


17 


PD02 8 07 


±\.rr \J XjX it v^ir.£\.W J. HiJL±m Hi 

PRECURSOR APO-E 
GLYCOPROTEIN PLAS . 


PD02807D 7.99 5.534e-09 105-155 


18 


DM003 03 


O i_i.Ci.fi 11 rlilK rLEiir Hi£\l. 

REPEAT . 


DM00303B 21.87 9.173e-07 217-252 


19 


BL00048 


ir J_ OUciUilIlc IrX pLULCiiio . 


BL00048 6.39 8.763e-09 106-133 
BL00048 6.39 9.663e-09 108-135 


20 


PF00 651 


J_5 X -LJ \ dlbU JVl-l 1 ^ W1X CLO IJX\- 

C/Ttk) domain 

piULclllb . 


PF00651 15.00 9.182e-15 76-89 


21 


Tin A A T A "1 


r> y ,J l\ 1 J X IN In Hi .In. i T i__ii"iJ_)x\_r_j.iJ j-j 

PROTEIN SIGNATURE 


PR00701B 15.26 7.188e-07 177-199 


22 


PRO 0701 


^ n "K"n TKrMT?P MPMRP AMR 

O \J f\ 1 ' J_ IN ± <J H I\. l*ini*10£\_tt-L\l J-J 

PROTEIN SIGNATURE 


PR00701B 15.26 7.188e-07 177-199 


23 


DM00060 


338 kw NEUREXIN ALPHA 

TTT PYQTPTNF 1 

XXX U X O J. Hi X1M Hi . 


DM00060 6.92 3.925e-10 210-220 


24 


PR00317 


EPENDYMIN SIGNATURE 


PR00317F 10.90 5.935e-09 117-132 


25 


PR00305 


SIGNATURE 


PR00305F 15.95 7.150e-06 226-256 


26 


PR00405 


tttt t ■n'CTr T"I\T r "PT?"D A r*T TT\TfZ 
HIV RniV XJM iHiKrlL, 1 XINVj 

PROTEIN SIGNATURE 


PR00405A 17.71 2.286e-20 19-39 
PR00405B 11.83 6.077e-15 38-56 
PR00405C 19.41 4.000e-13 59-81 


28 


PRO 03 34 


HMW KIJM IJNJUUliiJN 
SIGNATURE 


PR00334B 8.69 9.914e-09 867-891 


29 


PRO 05 51 


i f~i riT rMDTTT TTvT T7AMTT . V 

SIGNATURE 


PR00551E 10 27 6.211e-06 12-27 


30 


PR00300 


ATP -DEPENDENT CLP 

PKU lJiAbii ±\L±r -J_SXrJiJXiMvj 

SUBUNIT SIGNATURE 


PR00300A 9.56 7.896e-09 6-25 


31 


PR00467 


l*J^_i v ll v lr-Xj X _-UN J_l X XT WjT- X O-Cii-M J-j 

SIGNATURE 


PR00467E 9.00 1.000e-05 27-47 


33 


BL01210 


Caveolins proteins. 


BL01210B 13.92 1.000e-40 91-141 
BL01210A 17.61 6.000e-39 43-82 
BL01210C 17.86 8.579e-34 141-173 


34 


PR00658 


CD44 ANTIGEN PRECURSOR 
SIGNATURE 


PR00658D 8.52 9.919e-06 12-32 


35 


PR00512 


5 -HYDROXYTRYPTAMINE 1A 
RECEPTOR SIGNATURE 


PR00512E 10.80 4.884e-06 30-47 
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TABLE 4 



SEQ ID 
NO : 


P r i\rl IN HI V 1 Ji 


DESCRIPTION 


p-value 


pFAM SCORE 


2 




MVKTD fincier 


1 . le-06 


35.6 


3 


efhand 


EF hand 


0 . 0016 


25.1 


6 


DNA mis repair 


FiMS mi Qm^t~r*ll TPDSIT 

J_J.Ly.f-i. ILIi. DLLlCll'Ull -1- J-^ a. J- -L. 

nr*nt pin 


8 . 2e-36 


132,4 


7 


ank 


Ank repeat 


3.6e-197 


668 .4 


9 


homeoloox 


Wnmpnhny domain 


1.5e-29 


111.6 


10 


Collagen 


fnl 1 ^cfpn trinle helix 
rpnpaf (20 cooies) 


7 . 5e-46 


165.8 


14 


PRJ1 


PhnQnhnri Trvulokinase 


9 .5e-06 


-24.1 


17 


Apolipoprotein 


Apolipoprotein A1/A4/E 

f aiYii "1 \r 


5 .2e-113 


388. 8 


18 


BTB 


BTB/POZ domain 


2 . 6e-25 


97.5 


20 


Kelch 


Kelch motif 


1.7e-22 


88 .2 


23 


EGF 




3 . 5e-09 


43 .9 


24 


Clat__adaptor_s 


Clathrin adaptor 
complex smaj.i (__na.j-±i 


5 .7e-88 


305.7 


26 


Arf Gap 


Putative bir-ase 
activating protein for 
Arf 


6 . 3e-62 


219.2 


27 


SAM 


SAM domain (Sterile 
alpha motif) 


0.003 


20.7 


30 


ras 


Ras family 


0.0058 


-103 .2 


33 


Caveolin 


Caveolin 


3 ,7e-120 


412.6 
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TABLE 5 



SEQ ID NO: 


D^CTTTmvT OP QTfTWATi 
rUol 1 IviN ur oiuiiinij 

IN AMINO ACID 

SEQUENCE 


maxS {MAXIMUM SCORE) 


meanS (MEAN SCORE) 


10 


1-19 


0.968 


0.899 


12 


1-23 


0.960 


0.861 


16 


1-28 


0 .983 


0 .763 


17 


1-18 


0 .966 


0.921 


23 


1-28 


0 .963 


0. 881 


29 


1-20 


0 . 940 


0.677 



ill 
<s 

if! 
ill 
!J! 

Ui 

Ui 
ill 



□ 
Q 
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CLAIMS 

WHAT IS CLAIMED IS: 

1. An isolated polynucleotide comprising a nucleotide sequence selected from 
the group consisting of SEQ ID NO: 1 - 35, a mature protein coding portion of SEQ 
ID NO: 1 - 35, an active domain of SEQ ID NO: 1 - 35, and complementary 
sequences thereof. 

2. An isolated polynucleotide encoding a polypeptide with biological activity, 
wherein said polynucleotide hybridizes to the polynucleotide of claim 1 under 
stringent hybridization conditions. 

3. An isolated polynucleotide encoding a polypeptide with biological activity, 
wherein said polynucleotide has greater than about 90% sequence identity with the 
polynucleotide of claim 1. 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises 
the complementary sequences. 

6. A vector comprising the polynucleotide of claim 1 . 

7 . An expression vector comprising the polynucleotide of claim 1 . 

8 . A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim 1 
operatively associated with a regulatory sequence that modulates expression of the 
polynucleotide in the host cell. 

10. An isolated polypeptide, wherein the polypeptide is selected from the group 
consisting of: 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; and 
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(b) a polypeptide encoded by a polynucleotide hybridizing under stringent 
conditions with any one of SEQ ID NO: 1-35. 



11. A composition comprising the polypeptide of claim 1 0 and a carrier, 

5 

12. An antibody directed against the polypeptide of claim 1 0. 

13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms 
10 a complex with the polynucleotide of claim 1 for a period sufficient to form the 

complex; and 

b) detecting the complex, so that if a complex is detected, the 
polynucleotide of claim 1 is detected. 

15 14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions 
with nucleic acid primers that anneal to the polynucleotide of claim 1 under such 
conditions; 

b) amplifying a product comprising at least a portion of the 
20 polynucleotide of claim 1; and 

c) detecting said product and thereby the polynucleotide of claim 
1 in the sample. 

15. The method of claim 14, wherein the polynucleotide is an RNA molecule and 
25 the method further comprises reverse transcribing an annealed RNA molecule into a 

cDNA polynucleotide. 

16. A method for detecting the polypeptide of claim 1 0 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms 
30 a complex with the polypeptide under conditions and for a period sufficient to form 

the complex; and 

b) detecting formation of the complex, so that if a complex 
formation is detected, the polypeptide of claim 10 is detected. 
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17. A method for identifying a compound that binds to the polypeptide of claim 
10, comprising: 

a) contacting the compound with the polypeptide of claim 10 
under conditions sufficient to form apolypeptide/compound complex; and 
5 b) detecting the complex, so that if the polypeptide/compound 

complex is detected, a compound that binds to the polypeptide of claim 10 is 
identified. 

18. A method for identifying a compound that binds to the polypeptide of claim 

10 10, comprising: 

a) contacting the compound with the polypeptide of claim 10, in a 
cell, under conditions sufficient to form a polypeptide/compound complex, wherein 
the complex drives expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence 
15 expression, so that if the polypeptide/compound complex is detected, a compound 

that binds to the polypeptide of claim 10 is identified. 

19. A method of producing the polypeptide of claim 10, comprising, 

a) culturing a host cell comprising a polynucleotide sequence 
20 selected from the group consisting of a polynucleotide sequence of SEQ ID NO: 1-35, 
a mature protein coding portion of SEQ ID NO: 1-35, an active domain of SEQ ID 
NO: 1-35, complementary sequences thereof and a polynucleotide sequence 
hybridizing under stringent conditions to SEQ ID NO: 1-35, under conditions 
sufficient to express the polypeptide in said cell; and 
25 b) isolating the polypeptide from the cell culture or cells of step 

(a). 

20. An isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of any one of the polypeptides from the Sequence Listing, the 

30 mature protein portion thereof, or the active domain thereof. 

21. The polypeptide of claim 20 wherein the polypeptide is provided on a 
polypeptide array. 
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22. A collection of polynucleotides, wherein the collection comprising the 
sequence information of at least one of SEQ ID NO: 1-35. 

23. The collection of claim 22, wherein the collection is provided on a nucleic 
5 acid array. 

24. The collection of claim 23, wherein the array detects full-matches to any one 
of the polynucleotides in the collection. 

10 25. The collection of claim 23, wherein the array detects mismatches to any one 
of the polynucleotides in the collection. 

26. The collection of claim 22, wherein the collection is provided in a computer- 
readable format. 

15 

27. A method of treatment comprising administering to a mammalian subject in 
need thereof a therapeutic amount of a composition comprising a polypeptide of 
claim 10 or 20 and a pharmaceutical^ acceptable carrier. 

20 28. A method of treatment comprising administering to a mammalian subject in 
need thereof a therapeutic amount of a composition comprising an antibody that 
specifically binds to a polypeptide of claim 10 or 20 and a pharmaceutical^ 
acceptable carrier. 
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ABSTRACT OF THE INVENTION 

The present invention provides novel nucleic acids, novel polypeptide 
sequences encoded by these nucleic acids and uses thereof. 
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<110> Tang, Y. Tom 
Liu, Chenghua 
Zhou, Ping 
Asundi , Vinod 
Ren, Feiyan 
Zhao, Qing A. 
Zhang, Jie 
Wang, Jian-Rui 
Wehrman , Tom 
Drmanac, Radoje T. 



<120> Novel Nucleic Acids and 
Polypeptides 



<130> 789CIP2C 

<140> To Be Assigned 
<141> 

<150> 09/574,454 
<151> 2000-05-19 

<150> 09/519,705 
<151> 2000-03-07 



<160> 35 



<170> pt_FL_genes Version 2.0 



<210> 1 

<211> 2235 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (258) . . (671) 

<400> 1 

taagcttgcg gccgccccgg ccgcccccgc gggggacacc tcctcgccgc cacagcttta 
ggaaacacct ctacctcctg cgaggcctcc cgggctccgg gaaaactaca ctggccagac 
aattgcagca tgactttccc agggccctga ttttcagcac ggatgatttt ttcttcaggg 



1 



aagatggtgc ctatgagttc aatcctgact tcctggagga agctcatgaa tggaaccaaa 240 

aaagagcaag aaaagca atg agg aat ggc ata tec ccc att att att gat 2 90 

Met Arg Asn Gly lie Ser Pro lie lie lie Asp 
15 10 



338 



aat acc aac etc cac gec tgg gaa atg aag ccc tat gca gtc atg gca 
Asn Thr Asn Leu His Ala Trp Glu Met Lys Pro Tyr Ala Val Met Ala 
15 20 25 

ctt gaa aat aac tat gaa gtt ata ttc cga gaa cct gac act cgc tgg 
Leu Glu Asn Asn Tyr Glu Val He Phe Arg Glu Pro Asp Thr Arg Trp 
30 35 40 

aaa ttc aac gtt caa gag tta gca aga aga aac att cat ggt gtc tea 
Lys Phe Asn Val Gin Glu Leu Ala Arg Arg Asn He His Gly Val Ser 
45 50 55 

aga gaa aaa ate cac cga atg aaa gaa egg tat gaa cac gat gtt act 
Arg Glu Lys He His Arg Met Lys Glu Arg Tyr Glu His Asp Val Thr 
60 65 70 75 

ttt cac agt gtg ctt cat gca gaa aag cca age aga atg aac aga aac 
Phe His Ser Val Leu His Ala Glu Lys Pro Ser Arg Met Asn Arg Asn 
80 85 90 

cag gac agg aat aat gca ttg cct tec aac aat gec aga tac tgg aat 
Gin Asp Arg Asn Asn Ala Leu Pro Ser Asn Asn Ala Arg Tyr Trp Asn 
95 100 105 

tec tac aca gag ttt cca aac egg agg gec cac ggt gga ttt aca aat 
Ser Tyr Thr Glu Phe Pro Asn Arg Arg Ala His Gly Gly Phe Thr Asn 
110 115 120 

gag age tec tat cac aga agg ggc ggt tgt cac cat gga tat tag agg 
Glu Ser Ser Tyr His Arg Arg Gly Gly Cys His His Gly Tyr * 
125 130 135 

cctatcttac agecaggcag aattttccta agtcagtttc tacttcagtt tttgttattt 734 

tttgttgcat tttagtcaga gctccaattc cagtgtaaat agctgaactc aaaagtttct 7 94 

gagcaaagtc attatattca ctttcttcac caaaatttgt taaagtgctt etatatgeat 854 

ggtctgatgc tgggaattct gcagatttga gtaaacagtc tctttctcta gggtaagaat 914 

ttgaaaccaa aacttgagaa cacacccaag aatatattta cataggttca tagatgaaat 974 

aaagtgttta tattatatat aagcttcagt accatttget ctgaagtgat ctatttattt 1034 

tttcaggaaa ttcatctcca teggtaaagt tgggaaggtg gagagaagtg gtgggggggc 

attgetaett atcaaagtgc cattgetact ttgataatct atgtatctaa aaatgtgaga 

tgtgcgactc ttatgatact gattttcctt taatgttaat atgccagaaa gcatacatct 



386 



434 



482 



530 



578 



626 



674 



1094 
1154 
1214 
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aagggaacat 


tgtccttcaa 


agtagacact 


ttgggaagtt 


atttctttat 


tttaatgatg 


1274 


tatcattgtt 


aaaaatgctg 


tcaaatcctt 


aatagctaca 


ggagctactg 


agggaaatca 


1334 


gtgtcattat 


ttaaagtcac 


gccttgtgtt 


tttactactt 


tattcagcag 


gattaaacct 


1394 


gaataacttt 


tggctgttgt 


gctaatagtg 


taaataaaat 


aagcctgcct 


tcataaaaca 


1454 


ctaactttta 


aaaggaataa 


acgacttcta 


aaattatgcc 


tattaacatg 


tgtaattagt 


1514 


cggcagctca 


aatgtttggg 


agtgcaagaa 


attaggcacc 


ccaggatata 


ggtcatacag 


1574 


ggatatataa 


aagccatgct 


cattacaaaa 


tgagcagttg 


atgttttatg 


tggcattaag 


1634 


acaatcaagt 


cctcacaact 


ctggaatgtc 


ttcttatact 


gatgctgaat 


ttatgaatcc 


1694 


aaattaattt 


ccaacaggtt 


ggaatcagat 


ttaatgtgag 


atcatgatag 


acaagaccac 


1754 


agaggacgta 


tgctctattt 


cttgttggcc 


aacagcttct 


ttctaatgtt 


ctgtgaaaaa 


1814 


ttattttaag 


tgtcttatat 


aatggtgctt 


ttatggttat 


taaaaattgt 


aaatggtatc 


1874 


acatttatat 


ggatttgtca 


ttggatcttt 


ttttggttca 


acaataaaaa 


aatttaatta 


1934 


cctaaatgcc 


aagaaactca 


acaatatacc 


agtttttctg 


tatcacaggc 


ttatttacca 


1994 


gtcttttttt 


taataaatag 


gaatcgtaaa 


ggtaatgaca 


aaagcagcct 


tataatttag 


2054 


ttgcttatat 


atttgatctg 


tgtacatgag 


actgttttaa 


cgttatctga 


cactactgaa 


2114 


acctgctcga 


catctccatg 


actaccaaca 


ccatgtgtaa 


tgttttcttc 


actaacattt 


2174 


taaaaactgg 


tatctccttt 


gagtaagttt 


ggctgacaat 


agtaaatccc 


aatgaatcta 


2234 


a 












2235 


<210> 
<211> 
<212> 
<213> 


2 

4600 
DNA 

Homo sapiens 










<220> 
<221> 
<222> 


CDS 

(349) . . (4305) 










<400> 
ttcaaggtca 


2 

cgggccgtgc 


cagtccctac 


ccctagtgcc 


ccgcagcgtg 


ccagcccact 


60 


gatgccggca 


ggccgttacg 


ctatgcctgt 


ggcagcctcc 


cacctgggag 


aggactggcg 


120 


gccacgaggg 


caacttcgtg ggagaggtgg 


tggcgggctc 


ggggttcaga 


ggctgaagct 


180 


gggtcgcagc 


tcgaccgggg 


cgctgatctg 


tggctgtccg 


ttccttgctg 


gagaatttgg 


240 



ccacaaagag ctgccaagat agctgggcca ggaagaaagc gccgcagccc tgacccagac 3 00 

gctgttgccg accccggggc actctggctg tcgaccaagc ggctcaag atg tct ggc 357 

Met Ser Gly 
1 

999 9cc agt gcc aca ggc cca agg aga ggg ccc cca gga ctg gag gac 4 05 

Gly Ala Ser Ala Thr Gly Pro Arg Arg Gly Pro Pro Gly Leu Glu Asp 
5 10 15 

acc act agt aag aag aag cag aag gat cga gca aac cag gag age aag 453 
Thr Thr Ser Lys Lys Lys Gin Lys Asp Arg Ala Asn Gin Glu Ser Lys 
20 25 30 35 

gat gga gat cct agg aaa gag aca ggg tct cga tat gtt gcc cag get 501 
Asp Gly Asp Pro Arg Lys Glu Thr Gly Ser Arg Tyr Val Ala Gin Ala 
40 45 50 

ggt ctt gaa cct ctg gcc tea ggt gat cct tct gcc tea gcc tec cat 549 
Gly Leu Glu Pro Leu Ala Ser Gly Asp Pro Ser Ala Ser Ala Ser His 
55 60 65 

gca get ggg ate aca ggc tea cgc cac cgt acc egg ctg ttc ttt cct 5 97 

Ala Ala Gly lie Thr Gly Ser Arg His Arg Thr Arg Leu Phe Phe Pro 
70 75 80 

tea teg tea ggg tea gca tec act cct caa gag gag cag acc aaa gag 645 
Ser Ser Ser Gly Ser Ala Ser Thr Pro Gin Glu Glu Gin Thr Lys Glu 
85 90 95 

gga get tgt gaa gac cct cat gat etc ttg get act ccc act cca gag 693 
Gly Ala Cys Glu Asp Pro His Asp Leu Leu Ala Thr Pro Thr Pro Glu 
100 105 110 115 

ttg ttg etc gat tgg agg cag agt gca gaa gag gtg att gtc aag ctt 741 
Leu Leu Leu Asp Trp Arg Gin Ser Ala Glu Glu Val lie Val Lys Leu 
120 125 130 

cgt gtg gga gta ggt ccc ctg cag ctg gag gat gta gat get get ttc 789 
Arg Val Gly Val Gly Pro Leu Gin Leu Glu Asp Val Asp Ala Ala Phe 
135 140 145 

aca gat aca gac tgt gtg gtg egg ttt gca ggt ggt cag cag tgg ggt 83 7 

Thr Asp Thr Asp Cys Val Val Arg Phe Ala Gly Gly Gin Gin Trp Gly 
150 155 160 

ggt gtc ttc tat get gag ata aaa age tct tgt get aaa gtg caa acc 8 85 

Gly Val Phe Tyr Ala Glu lie Lys Ser Ser Cys Ala Lys Val Gin Thr 
165 170 175 

cgc aag ggc agt etc ctg cac ctg aca ctg ccc aaa aag gtg cct atg 933 
Arg Lys Gly Ser Leu Leu His Leu Thr Leu Pro Lys Lys Val Pro Met 
180 185 190 195 

etc acg tgg ccc tec etc ctg gtt gag get gat gaa cag ctt tgc ata 981 
Leu Thr Trp Pro Ser Leu Leu Val Glu Ala Asp Glu Gin Leu Cys lie 
200 205 210 
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cca ccg ctg aac tec caa acc tgc etc ctg ggc tea gag gag aat tta 102 9 

Pro Pro Leu Asn Ser Gin Thr Cys Leu Leu Gly Ser Glu Glu Asn Leu 
215 220 225 

gec cct ttg gca gga gag aaa gca gtg cct ccc ggg aat gac cca gtc 1077 
Ala Pro Leu Ala Gly Glu Lys Ala Val Pro Pro Gly Asn Asp Pro Val 
230 235 240 

tct cca gee atg gtc egg age aga aac cct ggg aaa gat gac tgt gec 1125 
Ser Pro Ala Met Val Arg Ser Arg Asn Pro Gly Lys Asp Asp Cys Ala 
245 250 255 

aag gag gag atg gca gtg gca gca gat get gca acc ttg gtg gat gag 1173 
Lys Glu Glu Met Ala Val Ala Ala Asp Ala Ala Thr Leu Val Asp Glu 
260 265 270 275 

ccc gag teg atg gtg aac ctg gcg ttt gtc aag aat gac teg tat gag 1221 
Pro Glu Ser Met Val Asn Leu Ala Phe Val Lys Asn Asp Ser Tyr Glu 
280 285 290 

aag ggc ccg gat tea gtg gtg gtg cac gtg tac gtg aag gag ate tgc 12 69 

Lys Gly Pro Asp Ser Val Val Val His Val Tyr Val Lys Glu lie Cys 
295 300 305 

agg gac acc tea aga gta ctt ttc cgt gag cag gac ttc acg etc ate 1317 
Arg Asp Thr Ser Arg Val Leu Phe Arg Glu Gin Asp Phe Thr Leu He 
310 315 320 

ttc cag acc agg gat gga aac ttc ctg agg ctg cac ccg ggc tgt ggg 13 65 

Phe Gin Thr Arg Asp Gly Asn Phe Leu Arg Leu His Pro Gly Cys Gly 
325 330 335 

ccc cac acc acc ttc cgt tgg cag gtg aag etc agg aat ctg att gag 1413 
Pro His Thr Thr Phe Arg Trp Gin Val Lys Leu Arg Asn Leu He Glu 
340 345 350 355 

cca gag cag tgc acc ttc tgt ttc acg get tct cgc ate gac ate tgc 1461 
Pro Glu Gin Cys Thr Phe Cys Phe Thr Ala Ser Arg He Asp He Cys 
360 365 370 

ctt cgt aag agg cag agt cag cgc tgg ggg ggc ctg gag gee ccg get 1509 
Leu Arg Lys Arg Gin Ser Gin Arg Trp Gly Gly Leu Glu Ala Pro Ala 
375 380 385 

gca cga gtg ggt ggt gca aag gtt gee gtg ccg aca ggt cca acc cct 1557 
Ala Arg Val Gly Gly Ala Lys Val Ala Val Pro Thr Gly Pro Thr Pro 
390 395 400 

ctg gat tea acc cca cca gga ggt get ccc cac ccc ctg aca ggc cag 1605 
Leu Asp Ser Thr Pro Pro Gly Gly Ala Pro His Pro Leu Thr Gly Gin 
405 410 415 

gag gag gee egg get gtg gag aag gat aaa tec aag gca cga tct gag 1653 
Glu Glu Ala Arg Ala Val Glu Lys Asp Lys Ser Lys Ala Arg Ser Glu 
420 425 430 435 
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gac aca ggg eta gac agt gtg gca acc cgc aca ccc atg gag cat gta 1701 
Asp Thr Gly Leu Asp Ser Val Ala Thr Arg Thr Pro Met Glu His Val 
440 445 450 

acc cca aag cca gag aca cac ctg gec teg ccc aag cct aca tgc atg 1749 
Thr Pro Lys Pro Glu Thr His Leu Ala Ser Pro Lys Pro Thr Cys Met 
455 460 465 

gtg cct ccc atg ccc cac age cca gtt agt gga gac age gtg gag gag 17 97 
Val Pro Pro Met Pro His Ser Pro Val Ser Gly Asp Ser Val Glu Glu 
470 475 480 

gag gaa gag gaa gag aag aag gtg tgt ctg cca ggc ttc act ggc ctt 1845 
Glu Glu Glu Glu Glu Lys Lys Val Cys Leu Pro Gly Phe Thr Gly Leu 
485 490 495 

gtc aat tta ggc aac acc tgc ttc atg aac age gtc att cag tct ctg 1893 
Val Asn Leu Gly Asn Thr Cys Phe Met Asn Ser Val lie Gin Ser Leu 
500 505 510 515 

tec aac act egg gaa etc egg gac ttc ttc cat gac cgc tec ttt gag 1941 
Ser Asn Thr Arg Glu Leu Arg Asp Phe Phe His Asp Arg Ser Phe Glu 
520 525 530 

get gag ate aac tac aac aac cca eta ggg act ggt ggg cgt ctg gec 1989 
Ala Glu lie Asn Tyr Asn Asn Pro Leu Gly Thr Gly Gly Arg Leu Ala 
535 540 545 

att ggc ttt gec gtg ctg ctt egg gcg ctg tgg aag ggc acc cac cat 203 7 

lie Gly Phe Ala Val Leu Leu Arg Ala Leu Trp Lys Gly Thr His His 
550 555 560 

gec ttc cag cct tec aag ttg aag gee att gtg gcg agt aag gec age 2085 
Ala Phe Gin Pro Ser Lys Leu Lys Ala lie Val Ala Ser Lys Ala Ser 
565 570 575 

cag ttc aca ggc tat gca cag cat gat gee cag gag ttc atg get ttc 2133 
Gin Phe Thr Gly Tyr Ala Gin His Asp Ala Gin Glu Phe Met Ala Phe 
580 585 590 595 

ctg ctg gat ggg ctg cac gag gac ctg aat cgc att cag aac aag ccc 2181 
Leu Leu Asp Gly Leu His Glu Asp Leu Asn Arg lie Gin Asn Lys Pro 
600 605 610 

tac aca gag acc gtg gat tea gat ggg egg ccc gat gag gtg gta get 2229 
Tyr Thr Glu Thr Val Asp Ser Asp Gly Arg Pro Asp Glu Val Val Ala 
615 620 625 

gag gaa gca tgg cag egg cac aag atg agg aat gac tct ttc ate gtg 2277 
Glu Glu Ala Trp Gin Arg His Lys Met Arg Asn Asp Ser Phe lie Val 
630 635 640 

gac eta ttt cag ggg cag tac aag teg aag ctg gtg tgc cct gtg tgt 2325 
Asp Leu Phe Gin Gly Gin Tyr Lys Ser Lys Leu Val Cys Pro Val Cys 
645 650 655 

gec aag gtc tec ate act ttt gac ccg ttt ctt tat ctg ccg gtg ccc 2373 
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Ala Lys Val Ser lie Thr Phe Asp Pro Phe Leu Tyr Leu Pro Val Pro 
660 665 670 675 

ttg cca caa aag caa aag gtt etc cct gtc ttt tat ttt gec cga gag 2421 
Leu Pro Gin Lys Gin Lys Val Leu Pro Val Phe Tyr Phe Ala Arg Glu 
680 685 690 

ccc cac age aag ccc ate aag ttc ctg gtg age gtc age aag gag aac 2469 
Pro His Ser Lys Pro lie Lys Phe Leu Val Ser Val Ser Lys Glu Asn 
695 700 705 

tec act gcg age gaa gta ttg gac tec etc tct cag agt gtt cat gtg 2517 
Ser Thr Ala Ser Glu Val Leu Asp Ser Leu Ser Gin Ser Val His Val 
710 715 720 

aag cct gag aac ctg cgt ttg gcg gag gta att aag aat cgt ttt cat 2565 
Lys Pro Glu Asn Leu Arg Leu Ala Glu Val lie Lys Asn Arg Phe His 
725 730 735 

cgt gtg ttc cta ccc tcc cac tea ct g g ac act gtg tcc cca tct g at 2613 

Arg Val Phe Leu Pro Ser His Ser Leu Asp Thr Val Ser Pro Ser Asp 
740 745 750 755 

acg etc etc tgc ttt gag ctg eta tcc tca gag ttg get aag gag egg 2661 
Thr Leu Leu Cys Phe Glu Leu Leu Ser Ser Glu Leu Ala Lys Glu Arg 
760 765 770 

gta gtg gtg eta gag gtg caa cag cgc ccc cag gtg ccc age gtc ccc 2709 
Val Val Val Leu Glu Val Gin Gin Arg Pro Gin Val Pro Ser Val Pro 
775 780 785 

ate tcc aag tgt gca gee tgc cag egg aag caa cag teg gag gat gaa 2 7 57 
lie Ser Lys Cys Ala Ala Cys Gin Arg Lys Gin Gin Ser Glu Asp Glu 
790 795 800 

aag ctg aag cgc tgt ace egg tgc tac cgt gtg ggc tac tgc aac cag 2805 
Lys Leu Lys Arg Cys Thr Arg Cys Tyr Arg Val Gly Tyr Cys Asn Gin 
805 810 815 

etc tgc cag aaa acc cac tgg cct gac cac aag ggc etc tgc cga cct 2853 
Leu Cys Gin Lys Thr His Trp Pro Asp His Lys Gly Leu Cys Arg Pro 
820 825 830 835 

gag aac att ggc tac ccc ttc ctg gtc agt gta cct gee tca cgc etc 2901 
Glu Asn lie Gly Tyr Pro Phe Leu Val Ser Val Pro Ala Ser Arg Leu 
840 845 850 



act tat gec cgc etc get cag ttg eta gag ggc tat gec egg tac tct 
Thr Tyr Ala Arg Leu Ala Gin Leu Leu Glu Gly Tyr Ala Arg Tyr Ser 

855 860 865 



2949 



gtg agt gta ttc cag cca ccc ttt cag cca ggc cgc atg gee ttg gag 2 997 
Val Ser Val Phe Gin Pro Pro Phe Gin Pro Gly Arg Met Ala Leu Glu 
870 875 880 

tct cag age cct ggc tgc acc aca ctg etc tcc aca ggt tcc ctg gag 3 045 
Ser Gin Ser Pro Gly Cys Thr Thr Leu Leu Ser Thr Gly Ser Leu Glu 
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885 890 895 

get ggg gac age gag aga gac ccc att cag cca cct gag etc cag ctg 3093 

Ala Gly Asp Ser Glu Arg Asp Pro He Gin Pro Pro Glu Leu Gin Leu 

900 905 910 915 

gtg acc cct atg get gag ggg gac aca ggg ctt ccc egg gtg tgg gca 3141 

Val Thr Pro Met Ala Glu Gly Asp Thr Gly Leu Pro Arg Val Trp Ala 

920 925 930 



gec cct gac egg ggt cct gtg ccc age acc agt gga att tct tct gag 
Ala Pro Asp Arg Gly Pro Val Pro Ser Thr Ser Gly He Ser Ser Glu 
935 940 945 



3189 



atg ctg gee agt ggg ccc att gag gtt ggc tec ttg cca get ggc gag 3237 
Met Leu Ala Ser Gly Pro He Glu Val Gly Ser Leu Pro Ala Gly Glu 
950 955 960 

agg gtg tec cga ccc gaa get get gtg cct ggg tac cag cat cca agt 3285 
Arg Val Ser Arg Pro Glu Ala Ala Val Pro Gly Tyr Gin His Pro Ser 
965 970 975 

gaa get atg aat gee cac aca ccc cag ttc ttc ate tat aaa att gat 3333 
Glu Ala Met Asn Ala His Thr Pro Gin Phe Phe He Tyr Lys He Asp 
980 985 990 995 

tea tec aac cga gag cag egg eta gag gac aaa gga gac acc cca ctg 3 3 81 

Ser Ser Asn Arg Glu Gin Arg Leu Glu Asp Lys Gly Asp Thr Pro Leu 
1000 1005 1010 

gag ctg ggt gac gac tgt age ctg get etc gtc tgg egg aac aat gag 3429 
Glu Leu Gly Asp Asp Cys Ser Leu Ala Leu Val Trp Arg Asn Asn Glu 
1015 1020 1025 

cgc ttg cag gag ttt gtg ttg gta gee tec aag gag ctg gaa tgt get 3477 
Arg Leu Gin Glu Phe Val Leu Val Ala Ser Lys Glu Leu Glu Cys Ala 
1030 1035 1040 

gag gat cca ggc tct gee ggt gag get gee egg gee ggc cac ttc acc 3 52 5 

Glu Asp Pro Gly Ser Ala Gly Glu Ala Ala Arg Ala Gly His Phe Thr 
1045 1050 1055 

ctg gac cag tgc etc aac etc ttc aca egg cct gag gtg ctg gca ccc 
Leu Asp Gin Cys Leu Asn Leu Phe Thr Arg Pro Glu Val Leu Ala Pro 
1060 1065 1070 1075 

gag gag gee tgg tac tgc cca cag tgc aaa cag cac cgt gag gec tec 
Glu Glu Ala Trp Tyr Cys Pro Gin Cys Lys Gin His Arg Glu Ala Ser 
1080 1085 1090 

aag cag ctg ttg eta tgg cgc ctg cca aat gtt etc ate gtg cag etc 
Lys Gin Leu Leu Leu Trp Arg Leu Pro Asn Val Leu He Val Gin Leu 
1095 HOO 1105 

aag cgc ttc tec ttt cgt agt ttt ate tgg cgt gac aag ate aat gac 3717 
Lys Arg Phe Ser Phe Arg Ser Phe He Trp Arg Asp Lys He Asn Asp 
1110 1115 1120 



3573 



3621 



3669 



8 



ttg gtg gag ttc cct gtt agg aac ctg gac ctg age aag ttc tgc att 3765 
Leu Val Glu Phe Pro Val Arg Asn Leu Asp Leu Ser Lys Phe Cys He 
1125 H30 1135 

ggt cag aaa gag gag cag ctg ccc age tac gat eta tat get gtc ate 3 813 
Gly Gin Lys Glu Glu Gin Leu Pro Ser Tyr Asp Leu Tyr Ala Val He 
1140 H45 H50 H55 . 

aac cac tat gga ggc atg att ggt ggc cac tac act gee tgt gca cgc 3 861 
Asn His Tyr Gly Gly Met He Gly Gly His Tyr Thr Ala Cys Ala Arg 
1160 1165 H70 



ctg ccc aat gat cgt age agt cag cgc agt gac gtg ggc tgg cgc ttg 
Leu Pro Asn Asp Arg Ser Ser Gin Arg Ser Asp Val Gly Trp Arg Leu 
1175 1180 1185 



cgc tgg aga tga get cgcctgcagg cagctgetgt gagctggect acctgcctgc 
Arg Trp Arg * 



3909 



ttt gat gac age aca gtg aca acg gta gac gag age cag gtt gtg acg 3957 
Phe Asp Asp Ser Thr Val Thr Thr Val Asp Glu Ser Gin Val Val Thr 
1190 1195 1200 



4005 



cgt tat gec tat gta etc ttc tac cgc egg egg aac tct cct gtg gag 
Arg Tyr Ala Tyr Val Leu Phe Tyr Arg Arg Arg Asn Ser Pro Val Glu 
1205 1210 1215 

agg ccc ccc agg gca ggt cac tct gag cac cac cca gac eta ggc cct 4053 
Arg Pro Pro Arg Ala Gly His Ser Glu His His Pro Asp Leu Gly Pro 
1220 1225 1230 1235 

gca get gag get get gee age cag get tec egg att tgg cag gag ctg 4101 
Ala Ala Glu Ala Ala Ala Ser Gin Ala Ser Arg He Trp Gin Glu Leu 
1240 1245 1250 

gag get gag gag gag ccg gtg cct gag ggg tct ggg ccc ctg ggt ccc 414 9 

Glu Ala Glu Glu Glu Pro Val Pro Glu Gly Ser Gly Pro Leu Gly Pro 
1255 1260 1265 

tgg ggg ccc caa gac tgg gtg ggc ccc eta cca cgt ggc cct ace aca 4197 
Trp Gly Pro Gin Asp Trp Val Gly Pro Leu Pro Arg Gly Pro Thr Thr 
1270 1275 1280 

cca gat gag ggc tgc etc egg tac ttt gtc ctg ggc ace gtg gcg get 4245 
Pro Asp Glu Gly Cys Leu Arg Tyr Phe Val Leu Gly Thr Val Ala Ala 
1285 1290 1295 

ttg gtg gee etc gtg etc aac gtg ttc tat cct ctg gta tec cag agt 4293 
Leu Val Ala Leu Val Leu Asn Val Phe Tyr Pro Leu Val Ser Gin Ser 
1300 1305 1310 1315 



4348 



cccaggccat gcctgccttt gttgtgggga acacctctgg getttgggee tcagcttatg 4408 
catctggtgg gagagggtgg ggaggttgtg gcccctgcag gggcagagta tcctagggtg 44 68 



9 



tgtatccatc tggctgtctg tccattcatc ctgctgctct gacccttggc ctcaggcttg 4528 
gccctgccca agctacttcc tgtacttaaa agtgttaata aaaccagact attcaggccc 4588 
aaaaaaaaaa aa 4600 



<210> 3 

<211> 2663 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (115) . . (2157) 
<400> 3 

cggaattccc gggtcgacga tttcgtcgcg gccggctgag tcctctccag ccgcgagagg 60 

cgtttctcca tccgcggctc gcgcgctcgc tctgagcccc cgcgcccagg tggg atg 117 

Met 

1 

gaa gaa gcc tgt cag gtg cag aca act aag cga ggg gat cct cat gag 165 
Glu Glu Ala Cys Gin Val Gin Thr Thr Lys Arg Gly Asp Pro His Glu 
5 10 15 

tta aga aac ata ttt eta cag tat gcc agt act gag gtt gat gga gag 213 
Leu Arg Asn lie Phe Leu Gin Tyr Ala Ser Thr Glu Val Asp Gly Glu 
20 25 30 

cgt tac atg acc cca gaa gac ttt gtt cag cgc tat ctt gga ctg tat 2 61 

Arg Tyr Met Thr Pro Glu Asp Phe Val Gin Arg Tyr Leu Gly Leu Tyr 
35 40 45 

aat gat cca aat agt aac cca aag ate gtg cag etc ttg gca gga gta 30 9 

Asn Asp Pro Asn Ser Asn Pro Lys lie Val Gin Leu Leu Ala Gly Val 
50 55 60 65 

get gat caa acc aag gat ggg ttg ate tec tat caa gag ttt ttg gca 357 
Ala Asp Gin Thr Lys Asp Gly Leu lie Ser Tyr Gin Glu Phe Leu Ala 
70 75 80 

ttt gaa tct gtt tta tgt get cca gat tec atg ttc ata gtg get ttc 405 
Phe Glu Ser Val Leu Cys Ala Pro Asp Ser Met Phe lie Val Ala Phe 
85 90 95 

cag ttg ttt gac aag agt gga aat gga gag gtg aca ttt gaa aat gtc 45 3 

Gin Leu Phe Asp Lys Ser Gly Asn Gly Glu Val Thr Phe Glu Asn Val 
100 105 110 

aaa gaa att ttt gga cag act att att cat cat cat ate cct ttt aac 501 
Lys Glu He Phe Gly Gin Thr He He His His His He Pro Phe Asn 
115 120 125 
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tgg gat tgt 
Trp Asp Cys 
130 

cat ctt aac 
His Leu Asn 



gaa cat gca 
Glu His Ala 



atg att tct 
Met lie Ser 
180 

cac atg ctt 
His Met Leu 
195 

gga agt ate 
Gly Ser lie 
210 

teg tta ctg 
Ser Leu Leu 



get ggc aca 
Ala Gly Thr 



agt gec ata 
Ser Ala lie 
260 

tat cag ctt 
Tyr Gin Leu 
275 

gat att gag 
Asp lie Glu 
290 

ctg gca gaa 
Leu Ala Glu 



tgg etc cag 
Trp Leu Gin 



get gga get 
Ala Gly Ala 
340 

acc cga atg 



gaa ttt ate 
Glu Phe He 
135 

tac aca gaa 
Tyr Thr Glu 
150 

aga caa gec 
Arg Gin Ala 
165 

ggt ctg gat 
Gly Leu Asp 



act cct ttt 
Thr Pro Phe 



tea cac cag 
Ser His Gin 
215 

aat aac atg 
Asn Asn Met 
230 

agg aaa gat 
Arg Lys Asp 
245 

cgc tat gga 
Arg Tyr Gly 



gca gac tta 
Ala Asp Leu 



aga ata gec 
Arg He Ala 
295 

ctt cag aga 
Leu Gin Arg 
310 

att gec gag 
He Ala Glu 
325 

gtg gga gec 
Val Gly Ala 



caa aac cag 



cga ctg cat 
Arg Leu His 



ttc acg cag 
Phe Thr Gin 



ttt gca etc 
Phe Ala Leu 
170 

ttc agt gac 
Phe Ser Asp 
185 

gtg gag gag 
Val Glu Glu 
200 

gtt age ttc 
Val Ser Phe 



gag ctt gtt 
Glu Leu Val 



gtt gaa gtc 
Val Glu Val 
250 

caa gtc aca 
Gin Val Thr 
265 

tat aat get 
Tyr Asn Ala 
280 

cca ttg get 
Pro Leu Ala 



cag cag tct 
Gin Gin Ser 



tct get tac 
Ser Ala Tyr 
330 

act gca gtg 
Thr Ala Val 
345 

cgt ggc tct 



ttt ggg cat 
Phe Gly His 
140 

ttt etc cag 
Phe Leu Gin 
155 

aaa gac aaa 
Lys Asp Lys 



ate atg gtt 
He Met Val 



aac tta gtt 
Asn Leu Val 
205 

tec tac ttc 
Ser Tyr Phe 
220 

cgt aag ata 
Arg Lys He 
235 

aca aag gag 
Thr Lys Glu 



cca eta gaa 
Pro Leu Glu 



tea ggg cgc 
Ser Gly Arg 
285 

gag ggg gee 
Glu Gly Ala 
300 

cct ggg tta 
Pro Gly Leu 
315 

aga ttc act 
Arg Phe Thr 



tat cct ata 
Tyr Pro He 



ggc tct gtt 



aac egg aag 
Asn Arg Lys 



gag ctg caa 
Glu Leu Gin 
160 

age aaa agt 
Ser Lys Ser 
175 

acc att aga 
Thr He Arg 
190 

tea gca get 
Ser Ala Ala 



aat gca ttt 
Asn Ala Phe 



tat age act 
Tyr Ser Thr 
240 

gaa ttt gec 
Glu Phe Ala 
255 

att gat att 
He Asp He 
270 

ttg act ttg 
Leu Thr Leu 



tta cct tac 
Leu Pro Tyr 



ggc agg cct 
Gly Arg Pro 
320 

ctg ggc tea 
Leu Gly Ser 
335 

gat ctg gtg 
Asp Leu Val 
350 

gtt ggg gag 



aag 54 9 

Lys 

145 

ttg 597 
Leu 



ggc 645 
Gly 



tct 693 
Ser 



gga 741 
Gly 



aac 789 

Asn 

225 

eta 837 
Leu 



cag 885 
Gin 



eta 933 
Leu 



gca 981 
Ala 



aac 1029 

Asn 

305 

ate 1077 
He 



gtt 1125 
Val 



aag 1173 
Lys 



Cta 1221 



11 



1413 



1461 



Thr Arg Met Gin Asn Gin Arg Gly Ser Gly Ser Val Val Gly Glu Leu 
355 360 365 

atg tac aaa aac age ttt gac tgt ttt aag aaa gtc ttg cgt tat gag 1269 
Met Tyr Lys Asn Ser Phe Asp Cys Phe Lys Lys Val Leu Arg Tyr Glu 
370 375 380 385 

ggc ttc ttt gga etc tac agg ggt ctg ata cca caa ctt ata ggg gtt 1317 
Gly Phe Phe Gly Leu Tyr Arg Gly Leu He Pro Gin Leu He Gly Val 
390 395 400 

get cca gaa aag gec att aaa ctg act gtt aat gat ttt gtt egg gac 1365 
Ala Pro Glu Lys Ala He Lys Leu Thr Val Asn Asp Phe Val Arg Asp 
405 410 415 

aaa ttt acc aga aga gat ggc tct gtt cca ctt cca gca gaa gtt ctt 
Lys Phe Thr Arg Arg Asp Gly Ser Val Pro Leu Pro Ala Glu Val Leu 
420 425 430 

get gga ggc tgt get gga ggc tct cag gtc att ttt acc aac cca ttg 
Ala Gly Gly Cys Ala Gly Gly Ser Gin Val He Phe Thr Asn Pro Leu 
435 440 445 

gag ata gtg aag att cgt ctg caa gta get gga gag ate acc acg gga 1509 
Glu He Val Lys He Arg Leu Gin Val Ala Gly Glu He Thr Thr Gly 
450 455 460 465 

ccc aga gtc age gee ctg aat gtg etc egg gac ttg gga att ttt ggt 
Pro Arg Val Ser Ala Leu Asn Val Leu Arg Asp Leu Gly He Phe Gly 
470 475 480 

ctg tat aag ggt gee aaa gcg tgt ttc etc cga gac att ccc ttc tct 
Leu Tyr Lys Gly Ala Lys Ala Cys Phe Leu Arg Asp He Pro Phe Ser 
485 490 495 

gca ate tat ttt cct gtt tat get cat tgc aaa eta ctt ctg get gat 
Ala He Tyr Phe Pro Val Tyr Ala His Cys Lys Leu Leu Leu Ala Asp 
500 505 510 

gaa aat gga cac gtg gga ggt tta aat ctt ctt gca get gga gee atg 
Glu Asn Gly His Val Gly Gly Leu Asn Leu Leu Ala Ala Gly Ala Met 
515 520 525 

gca ggt gtc cca get gca tct ctg gtg acc cct get gat gtc ate aag 
Ala Gly Val Pro Ala Ala Ser Leu Val Thr Pro Ala Asp Val He Lys 
530 535 540 545 

aca aga ctg cag gtg get gee cgc get ggc cag acg aca tac agt ggt 
Thr Arg Leu Gin Val Ala Ala Arg Ala Gly Gin Thr Thr Tyr Ser Gly 
550 555 560 

gtc ate gac tgt ttc agg aag att etc egg gaa gaa ggg ccc tea gca 
Val He Asp Cys Phe Arg Lys He Leu Arg Glu Glu Gly Pro Ser Ala 
565 570 575 

ttt tgg aaa ggg act gca get cga gtg ttt cga tec tct ccc cag ttt 
Phe Trp Lys Gly Thr Ala Ala Arg Val Phe Arg Ser Ser Pro Gin Phe 



1557 



1605 



1653 



1701 



1749 



1797 



1845 



1893 



12 



580 585 590 

ggt gtt acc ttg gtc act tat gaa ctt etc cag egg tgg ttt tac att 1941 
Gly Val Thr Leu Val Thr Tyr Glu Leu Leu Gin Arg Trp Phe Tyr He 
595 600 605 



gat ttt gga ggc etc aaa ccc get ggt tea gaa cca aca cct aag tea 
Asp Phe Gly Gly Leu Lys Pro Ala Gly Ser Glu Pro Thr Pro Lys Ser 
610 615 620 625 

cgc att gca gac ctt cct cct gec aac cct gat cac ate ggt gga tac 
Arg He Ala Asp Leu Pro Pro Ala Asn Pro Asp His He Gly Gly Tyr 
630 635 640 

aga etc gee aca gee acg ttt gca ggc ate gaa aac aaa ttt ggc ctt 
Arg Leu Ala Thr Ala Thr Phe Ala Gly He Glu Asn Lys Phe Gly Leu 
645 650 655 



1989 



2037 



2085 



tat etc ccg aaa ttt aag tct cct agt gtt get gtg gtt cag cca aag 2133 
Tyr Leu Pro Lys Phe Lys Ser Pro Ser Val Ala Val Val Gin Pro Lys 
660 665 670 

gca gca gtg gca gee act cag tga tgagacaact gttgagtgtg gcaaaatggc 2187 
Ala Ala Val Ala Ala Thr Gin * 
675 680 

gecttgaaga aagagectag gagagcagee ctgtaatgta tccagtcagc tgcatggtac 2247 
tgactgagct gaggagtcaa actcttcttt ctgtatgaca tatacatata cttgtttata 2307 
aaataatcat ttgcccaggg aaaaaaccac aacgctgttt caagctttag tcttatgtgt 2367 
tgaaatgttt ttgtaagcct tggcatgaat tagtgttcta gaetctgett tgeacagett 2427 
gcacttacag tgattgtaca tattgtacat ctttgtacag agacatcttg gcacctcatc 
ccaacaaatc acatttgtag aaatgtaatg eggttctgag tggcttgaaa tgtacagaat 
gttttgaaag tgttttatta agaatcacac aaaaataaat gtattaaaat taaattcatt 2607 
ctcttattgg tgacttatgg aaataaagca tcaatattgg atgtaaaaaa aaaaaa 



2487 
2547 



2663 
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<223> n = a # t,c or g 



<400> 4 

cgcgaggcaa canatgac atg ttg gcc ttc ctg tct ggg atg ccg gtg acc 51 

Met Leu Ala Phe Leu Ser Gly Met Pro Val Thr 
15 10 

aga aac acc aag tac etc gac etc aag aat tea caa gag atg etc cgc 99 
Arg Asn Thr Lys Tyr Leu Asp Leu Lys Asn Ser Gin Glu Met Leu Arg 
15 20 25 

tac aaa gag gtc tgc tac tac atg etc ttt gcc ctg get gcc tac ggg 147 
Tyr Lys Glu Val Cys Tyr Tyr Met Leu Phe Ala Leu Ala Ala Tyr Gly 
30 35 40 

tgg ccc atg tac ctg atg egg aag ccc gcc tgc ggc etc tgc caa ctg 195 
Trp Pro Met Tyr Leu Met Arg Lys Pro Ala Cys Gly Leu Cys Gin Leu 
45 50 55 

get egg tec tgc teg tgt tgc ctg tgt cct gcg agg ccg egg ttc gcc 243 
Ala Arg Ser Cys Ser Cys Cys Leu Cys Pro Ala Arg Pro Arg Phe Ala 
60 65 70 75 

cct gga gtc acc ate gag gaa gac aac tgc tgt ggc tgt aat gcc att 2 91 

Pro Gly Val Thr He Glu Glu Asp Asn Cys Cys Gly Cys Asn Ala He 
80 85 90 

gcc ate egg cgc cac ttc ctg gac gag aac atg act gcg gtg gac ate 33 9 

Ala He Arg Arg His Phe Leu Asp Glu Asn Met Thr Ala Val Asp He 
95 100 105 



gtc tat acc tec tgc cat gat gcg gtc tat gaa acg ccc ttc tac gtg 
Val Tyr Thr Ser Cys His Asp Ala Val Tyr Glu Thr Pro Phe Tyr Val 

115 120 



110 



387 



gcg gtg gac cat gac aag aag aaa gtg gtg ate agt ate egg ggg acc 435 

Ala Val Asp His Asp Lys Lys Lys Val Val He Ser He Arg Gly Thr 
125 130 135 

ctg tec ccc aag gat gcc ctg act gac ctg acg ggt gat get gag cgc 

Leu Ser Pro Lys Asp Ala Leu Thr Asp Leu Thr Gly Asp Ala Glu Arg 

140 145 150 155 

etc ccc gtg gag ggg cac cac ggc acc tgg ctg ggc cac aag ggt atg 

Leu Pro Val Glu Gly His His Gly Thr Trp Leu Gly His Lys Gly Met 
160 165 170 



483 



531 



gtc etc tea get gag tac ate aag aag aaa ctg gag cag gag atg gtc 57 9 

Val Leu Ser Ala Glu Tyr He Lys Lys Lys Leu Glu Gin Glu Met Val 
175 180 185 



627 



ctg tec cag gcc ttt ggg cga gac ctg ggc cgc gga acc aaa cac tac 
Leu Ser Gin Ala Phe Gly Arg Asp Leu Gly Arg Gly Thr Lys His Tyr 
190 195 200 

ggc ctg att gtg gtg ggc cac tec ctg ggc gcg ggc act get gcc ate 675 
Gly Leu He Val Val Gly His Ser Leu Gly Ala Gly Thr Ala Ala He 



14 



205 210 215 

etc tec ttc ctt ctg cgc cca cag tat ccg acc etc aag tgc ttt gec 723 
Leu Ser Phe Leu Leu Arg Pro Gin Tyr Pro Thr Leu Lys Cys Phe Ala 
220 225 230 235 

tac tec ccg cca ggg ggc ctg ctg agt gag gat gcg atg gag tat tec 771 
Tyr Ser Pro Pro Gly Gly Leu Leu Ser Glu Asp Ala Met Glu Tyr Ser 
240 245 250 

aag gag ttc gtg act get gtg gtt ctg ggc aaa gac etc gtc ccc agg 819 
Lys Glu Phe Val Thr Ala Val Val Leu Gly Lys Asp Leu Val Pro Arg 
255 260 265 

att ggc etc tct cag ctg gaa ggc ttc cgc aga cag etc ctg gat gtc 
lie Gly Leu Ser Gin Leu Glu Gly Phe Arg Arg Gin Leu Leu Asp Val 
270 275 280 

ctg cag cga age acc aag ccc aaa tgg egg ate ate gtg ggg gec acc 
Leu Gin Arg Ser Thr Lys Pro Lys Trp Arg He He Val Gly Ala Thr 
285 290 295 

aaa tgc ate ccc aag teg gag ctg cct gag gag gta gag gtg acc acc 
Lys Cys He Pro Lys Ser Glu Leu Pro Glu Glu Val Glu Val Thr Thr 
300 305 310 315 

ctg gee age acg egg etc tgg acc cac ccc age gac eta act ata gec 
Leu Ala Ser Thr Arg Leu Trp Thr His Pro Ser Asp Leu Thr He Ala 
320 325 330 

etc tea gec age act cca etc tac ccg ccc ggc cgc ate ate cac gtg 
Leu Ser Ala Ser Thr Pro Leu Tyr Pro Pro Gly Arg He He His Val 
335 340 345 

gtc cac aac cac cct gca gag cag tgc tgc tgc tgt gag cag gag gag 
Val His Asn His Pro Ala Glu Gin Cys Cys Cys Cys Glu Gin Glu Glu 
350 355 360 

ccc aca tac ttt gec ate tgg ggc gac aac aag gec ttc aat gag gtg 1155 
Pro Thr Tyr Phe Ala He Trp Gly Asp Asn Lys Ala Phe Asn Glu Val 
365 370 375 

ate ate teg cca gec atg ctg cat gag cac ctg ccc tat gtg gtc atg 1203 
He He Ser Pro Ala Met Leu His Glu His Leu Pro Tyr Val Val Met 
380 385 390 395 

gag ggg etc aac aag gtg ctg gag aac tac aac aag ggg aag acc get 1251 
Glu Gly Leu Asn Lys Val Leu Glu Asn Tyr Asn Lys Gly Lys Thr Ala 
400 405 410 

ctg etc tct gca gee aag gtc atg gtg age cct acc gag gtg gac ctg 12 99 

Leu Leu Ser Ala Ala Lys Val Met Val Ser Pro Thr Glu Val Asp Leu 
415 420 425 

act cct gag etc ate ttc cag cag cag cca etc ccc acg ggg ccg ccc 1347 
Thr Pro Glu Leu He Phe Gin Gin Gin Pro Leu Pro Thr Gly Pro Pro 
430 435 440 



867 



915 



963 



1011 



1059 



1107 
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atg ccc act ggc ctt gcc ctg gag ctg ccg act gca gac cac cgc aac 13 95 

Met Pro Thr Gly Leu Ala Leu Glu Leu Pro Thr Ala Asp His Arg Asn 
445 450 455 



ttc gcc gag ttc ate gac age etc ttc aac ctg gac age aag age age 
Phe Ala Glu Phe He Asp Ser Leu Phe Asn Leu Asp Ser Lys Ser Ser 
655 660 665 



1443 



1491 



1539 



age age gtc agg age aag tec cag tct gag atg age ctg gag ggc ttc 
Ser Ser Val Arg Ser Lys Ser Gin Ser Glu Met Ser Leu Glu Gly Phe 
460 465 470 475 

teg gag ggg egg ctg ctg teg cca gtg gtt gcg gcg gcg gcc cgc cag 
Ser Glu Gly Arg Leu Leu Ser Pro Val Val Ala Ala Ala Ala Arg Gin 
480 485 490 

gac ccg gtg gag ctg ctg ctg ctg tct ace cag gag egg ctg gca gcg 
Asp Pro Val Glu Leu Leu Leu Leu Ser Thr Gin Glu Arg Leu Ala Ala 
495 500 505 

gag ctg cag gcc egg egg gca cca ctg gcc ace atg gag age etc teg 1587 
Glu Leu Gin Ala Arg Arg Ala Pro Leu Ala Thr Met Glu Ser Leu Ser 
510 515 520 

gac act gag tec ctg tac age ttc gac teg cgc cgc tec tea ggc ttc 1635 
Asp Thr Glu Ser Leu Tyr Ser Phe Asp Ser Arg Arg Ser Ser Gly Phe 
525 530 535 

cgc age ate egg ggc tec ccc age etc cac get gtg ctg gag cgt gat 1683 
Arg Ser He Arg Gly Ser Pro Ser Leu His Ala Val Leu Glu Arg Asp 
540 545 550 555 

gaa ggc cac etc ttc tac att gac cct gcc ate ccc gag gaa aac cca 1731 
Glu Gly His Leu Phe Tyr He Asp Pro Ala He Pro Glu Glu Asn Pro 
560 565 570 

tec ctg age teg cgc act gag ctg ctg gcg gcc gac age ctg tec aag 1779 
Ser Leu Ser Ser Arg Thr Glu Leu Leu Ala Ala Asp Ser Leu Ser Lys 
575 580 585 

cac tea cag gac acg cag ccc ctg gag gcg gcc ctg ggc agt ggc ggc 1827 
His Ser Gin Asp Thr Gin Pro Leu Glu Ala Ala Leu Gly Ser Gly Gly 
590 595 600 

gtc act cct gag egg ccc ccc agt get gcg gcc aat gac gag gag gaa 1875 
Val Thr Pro Glu Arg Pro Pro Ser Ala Ala Ala Asn Asp Glu Glu Glu 
605 610 615 

gag gtt ggc ggt ggg ggt ggc ggg ccg gcc tec cgc ggg gag ctg gcg 192 3 

Glu Val Gly Gly Gly Gly Gly Gly Pro Ala Ser Arg Gly Glu Leu Ala 
620 625 630 635 

ctg cac aat ggg cgc ctg ggg gac teg ccc agt cct cag gtg ctg gaa 1971 
Leu His Asn Gly Arg Leu Gly Asp Ser Pro Ser Pro Gin Val Leu Glu 
640 645 650 



2019 
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tec ttc caa gac etc tac tgc atg gtg gtg ccc gag age ccc acc agt 2067 
Ser Phe Gin Asp Leu Tyr Cys Met Val Val Pro Glu Ser Pro Thr Ser 
670 675 680 

gac tac get gag ggc ccc aag tec ccc age cag caa gag ate ctg etc 2115 
Asp Tyr Ala Glu Gly Pro Lys Ser Pro Ser Gin Gin Glu lie Leu Leu 
685 690 695 

cgt gec cag ttc gag ccc aac ctg gtg ccc aag ccc cca egg etc ttt 2163 
Arg Ala Gin Phe Glu Pro Asn Leu Val Pro Lys Pro Pro Arg Leu Phe 
700 705 710 715 

gec ggc tea gec gac ccc tec teg ggc ate tea etc teg ccc tec ttc 2211 
Ala Gly Ser Ala Asp Pro Ser Ser Gly lie Ser Leu Ser Pro Ser Phe 
720 725 730 

ccg etc age tec teg ggt gag etc atg gac ctg acg ccc acg ggc etc 2259 
Pro Leu Ser Ser Ser Gly Glu Leu Met Asp Leu Thr Pro Thr Gly Leu 
735 740 745 

agt age cag gaa tgc ctg gcg get gac aag ate egg act tct acc ccc 23 07 

Ser Ser Gin Glu Cys Leu Ala Ala Asp Lys He Arg Thr Ser Thr Pro 
750 755 760 

act ggc cac gga gec age ccc gec aag caa gat gag ctg gtc ate tea 2355 
Thr Gly His Gly Ala Ser Pro Ala Lys Gin Asp Glu Leu Val He Ser 
765 770 775 

gca cgc tag cacccca gttgcgtggc cagccgggcc caggcaggag caggtggccc 2411 

Ala Arg * 

780 

tgtgggcacc tggtgcctgc cccctgccgg gcagctttaa ggacagaccc ccaggggcag 2471 

tttagectea ggcacaggca tegctgetga gctgggggtc cgcatcccta cctcagctta 2531 

ggacccccag agecaaggtg gctgggatct ggccccacag atggggaaag atggggaagg 2591 

gtgtggagtg gggaggagee tgggcagect gctgggtggg ccacactcag cctgactgcc 2651 

ctccatgggg gcattctggc accccctgct ccaggacagg ccatgggcaa gctgcctccc 2711 

atcactgcct getggctget ctcccagggg ccaggtggag agcagtgccc cccgacacat 2771 

gtattctcat ctgtggtcca ggccggcatc gtcctggcca ccccccagat ctggtgcctg 2831 

ctggccggcc ccctggggtg cccctgccga ggtggcctgc agtgctgtac atgtttacag 2891 

aagctgctgg gcttggctca ggatgtgttc tgggcttgea agccccccgc ccaatcatgt 2951 

gttcagtagc catcctctga gcagggccca aggcagecag gggcctggag gggecagagg 3 011 

a gggtggggt cagggccgcc ccttctctgc cttgtgcctc tcatgctgcc tcctctgccc 3071 

atgggtcctg ggcacccagg cctgccctgc ctgctggcta cttcctggct taccttctac 3131 

ccccaaggat cctcaccacc caaagggtgg tgggcactgc tgtgaccacc ccagctgcag 3191 
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agtcagtgcc 


ctgggtggaa 


ggaaggcact 


gagagccccc ttcctctgag ggccccacct 


3251 


caccccttgg 


tgtcaccccc 


accacgccta ggcagctctg ggccctggga tctggaacca 


3311 


acacacccct gttcccctca gctttccctc ctcgctggcc tgggcaccct cctgggagca 


3371 


ggccttcctc 


cctcccaccc 


ccaatgtcct 


gttggtagga ggtggggcca agagtggggt 


3431 


atggtgggcc 


ttggctggag 


acctctgtcc 


actgcccagg gaggggcctg gggctgggag 


3491 


cagtcccggt 


ttagcctgag gtccccatag ggcttcctcc cctgctgggt ttgggaagca 


3551 


gttagggaga tagcgacccg gagtttcccc agaagcgggg tgggagggtg tgcatgctag 


3611 


tgttggcgcg 


tatgcatgtg 


catgagtgtg 


caccgttcct aaggaagggg cctctggggc 


3671 


tgcccaccct 


acctgccctg 


cctgcctgct gcccctccca gcctgccaag aaaacggtag 


3731 


gggagcatga 


tggggccttt 


gaggcagggt 


cgcagggaca agctcagctt taggcaccat 


3791 


ctgttcccat 


cgcgcctgct 


gctgtgaccc 


gttttggaaa actggtgtgt accgaggcgc 


3851 


tgactgcacg 


gctgaccgcc 


tgctcgtgcc 


ttcattctgc agcggcatgg tccctcccat 


3911 


tctggctcca 


cctgcagcct 


ccctgggtgg 


cctaggctcc cccgaccaag agacctccct 


3971 


ctcatgatca 


ctggtacctg 


ggggcctgaa 


ttctggcccc cggctcccca cacagctggg 


4031 


actggcctgg 


atggctgtcc 


tgggagcccc 


tgcccaccct gacagaggga gctgggcctc 


4091 


ccctcatcct 


ctgtaactcc 


cgccttcacc 


agactcaagg acaccctggc cctgctgagg 


4151 


catacagagc 


ttcagcccag 


cacagaagca 


agacaaaatc agtggctctt agagtttaga 


4211 


aaacaagaca 


gactctcaga 


tgaaagatct 


gacaagcacc gtggccagtc acagggagag 


4271 


acttgatgtc 


tggcctttta 


attcctcctc 


tgccagggtg ggtcctggga cctctaatgt 


4331 


gggcatgtcg tccaccccag gacaagccat cagggacaga ccccccaccc ccaaggctgc 


4391 


agccacacca 


tgtttcaggc 


ttggggctgg ggcaggcttg ggctcaatcc tgggcaccca 


4451 


ggggcagccc 


acccctaacc 


tggctcctac 


ccaccttgcc cttgaaggat gggcctgctg 


4511 


cacgtctccc 


tcctccaccc 


cataccacac 


tggggggtct gagccacccc cctcagcccc 


4571 


gttcggctca gaccgacccc 


cactccatcc 


ccagacctgc agcacaagtg cgcgggcctg 


4631 


tcctcccagg 


ggcctgggcg 


actccatatg 


caatcagtag cgagcagccg ggccccacag 


4691 


accctcatgc 


actctcttac 


gtgccattct 


ccccagactt tttttgtact taatgtatga 


4751 


aagatccaaa 


ctaatattgc 


tgtaaaaagg 


agagacaaat taatatagct tattctataa 


4811 


atatatctgt 


atataaaggt 


ttctgtatat 


tgtatagagc tgtgtataaa ctggatgtag 


4871 
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aagcacgctg gctgcctcga atgtctttgc atcaggtggg gactgggtag 


aaatttgatg 


4931 


tcgaggttgc agcagagcag ggggttggca tggggccggg ggccgggggg 


gcctctgcca 


4991 


ctgttccctg gatggaacag aaagcctgct cctgctcggc tagtgccctg 


gcccggggcc 


5051 


acactttgag ttgcagggag ggagtagaag acccttgaag ccccctgtga 


atgagggcag 


5111 


ccacttgcgg agtcctgccc cactttgagc cctcctcttc ctctgcgaaa 


tggcctgatg 


5171 


ccagtgctgt gtgggtccac aggaggccag gagagtccca cggtggggag gacagggctg 


5231 


taccttctcc tgggctggcc tctacgccca ttacccatta accctcaggt 


gccagcatcc 


5291 


ctctccccag tgctgccttc ggtctaccac ctcctcctgg ccctgctccc 


actcaaggga 


5351 


cagtgatggg tgctgagagc tgattggaac tggagagggc accatttact 


gatcactgac 


5411 


ctggcacttt acctccactg taaggcaggg atactgagtg tgctttatag 


atgaggcctc 


5471 


tgaggacaga gggcaggcct tggggactag gtggagctgg ctacagggga 


cagccatgtc 


5531 


tgctgggcta gggctgaaat ctcagcccct cactcactgt ggctcggtag 


agaagccagg 


5591 


ggcacagatg aggactcatc tccattgatg ggccccccta ggtccttgta tgcaagtccc 


5651 


ctgggctact ttcaccggcc cagccacctt cctgccccag cctctgcccc 


agcgctgctt 


5711 


gggacccaac ttcattatgg agttggcaga tggcagcctc aactcttggc 


tgagcccttg 


5771 


agtctgggac atttcagcca cctctttcct ccagtccaga gatgaaaatc 


cctggggaca 


5831 


gttgctcctt tgctcagtga cctagtgtaa caggggagat ggcagggcct 


gagctccctt 


5891 


agccaggtca gctgctacag ggttaacagg aggctccatt ccaccccttc 


caacttcaag 


5951 


gctaccctag agattgaata atctatactc ttaattgatt ataatgcaat 


ggagttgggg 


6011 


cgttagggac aaagtacgag tcttcccttc tacctccagc ccttgctgac 


caggacaggg 


6071 


acaatgtgta gctcaacgga tggtggtgag aacttagatg atggtcagtt 


atgcagtatg 


6131 


tgggatacgg aggaaagatc cgtgggtatg tggaggcttg tagagaagct 


ggttctgtgg 


6191 


ctggtcccag gcgactcgta atgtaaatcc gtttctcaga atcgcgtggt 


gtaggcgggt 


6251 


gtctactttg tccgcaggca ggcctgaccc cgggtggagg aggggcaggg 


tggagaataa 


6311 


caattgtctt aagggagtct gcaagacagg agggggtggc agagaagaac 


ctcagctctg 


6371 


aagaagctca ctgcccagcc cttcccacct tcctcttcac ggacctagca 


ccttcctggg 


6431 


cctcagtttc tctcattgcc ttgggtgctg gagtatgtgg ggcctcctct 


cctatctcca 


6491 


ggccttcagc cccggctgcc acggggtgtg ggtacctctt ggttgggtct 


cggggtagga 


6551 


tgatgtaatg gttctgtgca ttcgccagcg agggcagctg gggtctgttc 


ctagctctcc 


6611 
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tgcttaccca cagtgcttct cttggccgta tcagggcacc gctgtgcctc cgctttctca 
tcttgaagat tactggtccc cagggtaggt cagtgcccct aagcttaggg ggcttgttga 
gcatgttctg tggttctgtg tgcaaggcct gaaccatgac agctctggcc cagcgtggcc 
tggtcctggt ccctggcaca tccagtgggg cccgccccac ct 



<210> 5 

<211> 1720 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (197) . . (619) 



<400> 5 

atgatcgcct tcggaaccgg cccggaattc ccgggtcgac ccacgcgtcc gctctttggc 60 

actagttcag aatggtgatg tgtcggcccc ctctgccata ctcagaacac cagaaagcac 120 

aaaaccgggt cctgtttgtc agccaccagt gagtcagagc cgctccctgt tttcttctgt 180 

cccgtccaag ccacca atg tct ctg gag cct caa aat ggg acg tat gca 

Met Ser Leu Glu Pro Gin Asn Gly Thr Tyr Ala 
15 10 

gga cca gcg cca gca ttc cag cca ttt ttc ttc act gga gca ttt cca 
Gly Pro Ala Pro Ala Phe Gin Pro Phe Phe Phe Thr Gly Ala Phe Pro 
15 20 25 

ttt aat atg caa gag ctg gta etc aag gtg aga att cag aac cca tct 
Phe Asn Met Gin Glu Leu Val Leu Lys Val Arg He Gin Asn Pro Ser 
30 35 40 



acc tac caa gag ttg etc aaa gtg tgt tgc tgt gag ctg ggt gtt aat 
Thr Tyr Gin Glu Leu Leu Lys Val Cys Cys Cys Glu Leu Gly Val Asn 
60 65 70 75 

cca gat caa gtg gag aag ate aga aag tta ccc aat act ctg tta agg 
Pro Asp Gin Val Glu Lys He Arg Lys Leu Pro Asn Thr Leu Leu Arg 
80 85 90 



229 



277 



325 



ctt cga gaa aat gat ttc att gaa att gaa ctg gac cga cag gag etc 373 
Leu Arg Glu Asn Asp Phe He Glu He Glu Leu Asp Arg Gin Glu Leu 
45 50 55 



421 



469 



aag gac aag gat gtt get cga etc caa gat ttc cag gag ctg gaa ctg 517 

Lys Asp Lys Asp Val Ala Arg Leu Gin Asp Phe Gin Glu Leu Glu Leu 
95 100 105 

gtt ctg atg ata agt gaa aat aat ttt ctg ttc aga aat get gca tec 5 65 
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Val Leu Met lie Ser Glu Asn Asn Phe Leu Phe Arg Asn Ala Ala Ser 
110 115 120 

aca ctg act gaa agg cct tgc tat aac agg aga get tea aaa ctg act 613 
Thr Leu Thr Glu Arg Pro Cys Tyr Asn Arg Arg Ala Ser Lys Leu Thr 
125 130 135 

tac taa tgcagcaggg acttttatca ctgagtatta tgacagtgtg catcacctct 669 

Tyr * 

140 

729 
789 
849 
909 
969 
1029 
1089 
1149 
1209 
1269 
1329 
1389 
1449 
1509 
1569 
1629 
1689 
1720 



gggccaagga 


caagecattg 


atetaaatge 


ctcagatgcc 


egggagggee 


tctggtgcca 


ctgcatagta 


tatactaaca 


tcattctgcc 


aaggtaggaa 


gcccctgacc 


cccaagcagt 


ggtgccactc 


ttccaagcct 


cttggtgcac 


agtaaaccta 


ttgcttgaag 


ctttgaacag 


ctgagaagtg 


gtctggagag 


gcagaagctg 


aaggttctat 


atccagtgtg 


ttttatgtcc 


agaatgtaag 


agagttgtct 


aagcagcagc 


tgagagagag 


eggagectat 


ttctagccac 


tcctgttgac 


agtgcacctg 


a-agggctggg 


atgcgttttt 


cttggtgttg 


catgctcaca 


actctgetga 


cattgggaac 


ttatgagaga 


ggaagactcg 


ggaaagcaca 


gatactggac 


agatggattc 


tggtgtgggg 


aaagcacaga 


tactggacag 


atggattcta 


gtgtgacttg 


tgactgtgag 


gtttcctata 


acatatttat 


aaatgttact 


caggttaaaa 


gtatttaaga 


atacagttaa 


ctaattgtaa 


atatgctgtt 


aaccaaaaga 


gctttccctc 


cctcactttt 


tcctttgtaa 


acactcatga 


ctgcttctct 


gtctcgagtc 


atetctgeat 


taactcccct 


tcgtggtcac 


tagagggctc 


tetgatgect 


tctaaaagaa 


caactgettt 


tttacaatgc 


cccccacccc 


caccccgccc 


atagagacag 


ggtctcacta 


tgtggcccag 


gctggtctca 


aacttctggc 


cttaagtgat 


gctcctgtcc 


ttggcctccc 


aaagtgctgg 


gattacaggt 


gtgagecact 


gcacccagcc 


cacttttttt 


ttactctgaa 


gtgattccag 


tcatatgtgt 


gtgtaactgc 


atcctatgag 


aagagcacaa 


atattgetgt 


tccatgttct 


ccactttcat 


tttccactac 


aaatgaaaag 


caatttttga 


gactgaatct 


gttgctattt 


taaaggttat 


tgtgggaaac 


tgagctaaag 


gagttagcat 


c 







<210> 6 

<211> 1658 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 
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<222> (523) . . (1479) 



^ a r\ r\ ^ 
<4UU> 


c 
O 










60 


ctagtacgcc 


tgtggaacgc 


ctgcaggtac 


cggtccggaa 


ttcccgggtc 


gacccacgcg 


tccgcccacg 


cgtccgctct 


gtctcaaata 


ataataataa 


taataataat 


aataataata 


120 


ataataataa 


tgtaggggac 


ttgatgaagg 


gaaaggatca 


gatagattct 


gaaaagaagt 


180 


acagaacctg 


ctaaggccat 


catacctatt 


gatcggaagt 


cagtccatca 


gatttgetet 


240 


ggaccggtgg 


tactgagtct 


aatcactgcg 


tgtaagaaga 


tagtaggaaa 


cattctggat 


300 


gctaggtgcc 


agtaatattg 


atctaaagct 


taaggactat 


ggaatggatc 


tcattgaagt 


360 


ttcaggcaat 


ggatgtgggg 


tagaagaaga 


aaacttcgaa ggcttaatct 


ctttcagctc 


420 


tgaaacatca 


cacatctaag 


attcaagagt 


ttgccgacct 


aactcgggtt 


gaaacttttg 


480 


gctttcgggg 


gaaagctctg 


agctcacttt 


gtgcactgag 


tg atg tea cca ttt 


534 








Met Ser Pro Phe 





eta cct gec acg tat egg cga agg ttg gga etc gac tgg tgt ttg ate 
Leu Pro Ala Thr Tyr Arg Arg Arg Leu Gly Leu Asp Trp Cys Leu He 
5 10 15 20 

acg atg gga aaa tea tec aga aaa ccc cct acc ccc ace cca gag gga 
Thr Met Gly Lys Ser Ser Arg Lys Pro Pro Thr Pro Thr Pro Glu Gly 
25 30 35 

ccc aca gtc age gtg aag cag tta ttt tct acg eta cct gtg cgc cat 
Pro Thr Val Ser Val Lys Gin Leu Phe Ser Thr Leu Pro Val Arg His 
40 45 50 

aag gaa ttt caa agg aat att aag aag aaa cgt gee tgc ttc ccc ttc 
Lys Glu Phe Gin Arg Asn He Lys Lys Lys Arg Ala Cys Phe Pro Phe 
55 60 65 

gee ttc tgc cgt gat tgt cag ttt ctt gag ggc tec cca gee atg ctt 
Ala Phe Cys Arg Asp Cys Gin Phe Leu Glu Gly Ser Pro Ala Met Leu 
70 75 80 

cct gta cag cct gca aaa ctt aca gaa cct get aag gec ate aaa cct 
Pro Val Gin Pro Ala Lys Leu Thr Glu Pro Ala Lys Ala He Lys Pro 
85 90 95 100 

att gat egg aag tea gtc cat cag att tgc tct ggg ccg gtg gta ctg 
He Asp Arg Lys Ser Val His Gin He Cys Ser Gly Pro Val Val Leu 
105 HO 115 

agt eta age act gcg gtg aag aag ata gta gga aac agt ctg gat get 
Ser Leu Ser Thr Ala Val Lys Lys He Val Gly Asn Ser Leu Asp Ala 
120 125 130 

ggt gec act aat att gat eta aag ctt aag gac tat gga atg gat etc 
Gly Ala Thr Asn He Asp Leu Lys Leu Lys Asp Tyr Gly Met Asp Leu 



582 



630 



678 



726 



774 



822 



870 



918 



966 
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135 140 145 

att gaa gtt tea ggc aat gga tgt ggg gta gaa gaa gaa aac ttc gaa 1014 
lie Glu Val Ser Gly Asn Gly Cys Gly Val Glu Glu Glu Asn Phe Glu 
150 155 160 



ggc tta tct ctt tea get ctg aaa cat cac aca tct aag att cga gag 
Gly Leu Ser Leu Ser Ala Leu Lys His His Thr Ser Lys He Arg Glu 
165 170 175 180 



agg aat att aag aag aaa cgt gec tgc ttc ccc ttc gec ttc tgc cgt 
Arg Asn He Lys Lys Lys Arg Ala Cys Phe Pro Phe Ala Phe Cys Arg 
265 270 275 

gat tgt cag ttt ctt gag ggc tec cca gee atg ctt cct gta cag cct 
Asp Cys Gin Phe Leu Glu Gly Ser Pro Ala Met Leu Pro Val Gin Pro 
280 285 290 

gca aaa ctg act gta act gga gag eta egg gca tgc aga agt tgg aag 
Ala Lys Leu Thr Val Thr Gly Glu Leu Arg Ala Cys Arg Ser Trp Lys 
295 300 305 



1062 



ttt gec gac eta act egg gtt gaa act ttt ggc ttt cag ggg aaa get 1110 
Phe Ala Asp Leu Thr Arg Val Glu Thr Phe Gly Phe Gin Gly Lys Ala 
185 190 195 

ctg age tea ctt tgt gca ctg agt gat gtc acc att tct ace tgc cac 1158 
Leu Ser Ser Leu Cys Ala Leu Ser Asp Val Thr He Ser Thr Cys His 
200 205 210 

gta teg gcg aag gtt ggg act cga ctg gtg ttt gat cac gat ggg aaa 1206 
Val Ser Ala Lys Val Gly Thr Arg Leu Val Phe Asp His Asp Gly Lys 
215 220 225 

ate ate aag aaa acc ccc tac ccc cac ccc aga ggg acc aca gtc age 1254 
He He Lys Lys Thr Pro Tyr Pro His Pro Arg Gly Thr Thr Val Ser 
230 235 240 

gtg aag cag tta ttt tct acg eta cct gtg cgc cat aag gaa ttt caa 13 02 

Val Lys Gin Leu Phe Ser Thr Leu Pro Val Arg His Lys Glu Phe Gin 
245 250 255 260 



1350 



1398 



1446 



acg agg gaa ggc ate aca gag get gtg ggg tga accgactt caaggaatgg 1497 
Thr Arg Glu Gly He Thr Glu Ala Val Gly * 
310 315 

gtccttccct tcagaaccac atgtgtgcgg gacacccaga cagaaaacac aaatgcaaag 1557 

tcaagtggag ggcatttgga aggagcagtg aagccaagcc aggaaacacc aagatggcga 1617 

gccagtgtgg ttgtagagat tgcagagagg gacaagataa g 165 8 



<210> 7 
<211> 3739 
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<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 

<222> (460) . , (3621) 
<400> 7 

gtctgctgta ataccctcta ctatagggac cactttgtac aagaaagctg ggtacgcgta 60 

agcttgggcc cctcgaggga ttctctagag cgtccgcggg ggctgccagg gtatttcggg 12 0 

aagggggcgt gaggaggcgg cggcggcagc ggcgggtagg gcaggcagca gagggaagga 18 0 

gaaagaaagg aaggaagagg gcggggagtc ctcagaggag gaggcgggac cggccgggca 240 

cctcacctct cggcggcggc ggcggcggcg gcggcggcgg cgggcggccg gggaggcggc 300 

gccgccccct gcccgcggtc ttctcagcgc agtcgggcgc ggacccgctg gtcccgggca 3 60 

gcggccaagg ctactggggc gggagcagtg ggccggtcgg cggcggcagc ggcagcggcg 42 0 

gaggaggagg aggctggagt gggcgcggag gcgaccgcc atg gcg ttc etc aaa 474 

Met Ala Phe Leu Lys 
1 5 

etc cgt gac cag cca tea ctg gtg caa get ata ttt aac gga gat cct 522 
Leu Arg Asp Gin Pro Ser Leu Val Gin Ala lie Phe Asn Gly Asp Pro 
10 15 20 

gat gaa gtt cga gca eta ata ttt aag aaa gaa gat gtt aac ttt cag 570 
Asp Glu Val Arg Ala Leu lie Phe Lys Lys Glu Asp Val Asn Phe Gin 
25 30 35 

gac aat gaa aag cga acc cca ttg cac gee gca get tac ctt gga gat 618 
Asp Asn Glu Lys Arg Thr Pro Leu His Ala Ala Ala Tyr Leu Gly Asp 
40 45 50 

gca gaa ate att gaa ctt ctt att tta tct gga get aga gtt aat gee 666 
Ala Glu lie lie Glu Leu Leu lie Leu Ser Gly Ala Arg Val Asn Ala 
55 60 65 

aaa gac age aaa tgg ttg aca cct tta cac aga gca gtt gca tct tgt 714 
Lys Asp Ser Lys Trp Leu Thr Pro Leu His Arg Ala Val Ala Ser Cys 
70 75 80 85 

agt gag gaa gca gtt cag gta ctt ttg aag cat tct gca gat gtt aat 762 
Ser Glu Glu Ala Val Gin Val Leu Leu Lys His Ser Ala Asp Val Asn 
90 95 100 

get cga gac aaa aat tgg caa acc cct tta cat ata get get get aat 810 
Ala Arg Asp Lys Asn Trp Gin Thr Pro Leu His lie Ala Ala Ala Asn 
105 110 115 

aaa get gta aag tgt get gaa get ttg gta cct ctt ctg agt aat gta 858 
Lys Ala Val Lys Cys Ala Glu Ala Leu Val Pro Leu Leu Ser Asn Val 
120 125 130 
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aac gta tct gat cga gca ggg agg act gca tta cat cat gca get ttc 906 
Asn Val Ser Asp Arg Ala Gly Arg Thr Ala Leu His His Ala Ala Phe 
135 140 145 



954 



1002 



agt gga cat ggt gag atg gtc aaa eta etc ttg tct aga ggt gec aat 
Ser Gly His Gly Glu Met Val Lys Leu Leu Leu Ser Arg Gly Ala Asn 
150 155 160 165 

att aat get ttt gac aag aaa gat agg cgt get ate cat tgg gca gca 
lie Asn Ala Phe Asp Lys Lys Asp Arg Arg Ala lie His Trp Ala Ala 
170 175 180 

tat atg ggt cac att gaa gta gtg aaa ttg ctt gtg teg cat gga get 1050 
Tyr Met Gly His He Glu Val Val Lys Leu Leu Val Ser His Gly Ala 
185 190 195 

gaa gtg aca tgc aag gat aaa aag tct tat aca cct ctt cat gca gca 1098 
Glu Val Thr Cys Lys Asp Lys Lys Ser Tyr Thr Pro Leu His Ala Ala 
200 205 210 

gee tct agt gga atg ate age gta gtc aag tac ctt eta gat ctt gga 1146 
Ala Ser Ser Gly Met He Ser Val Val Lys Tyr Leu Leu Asp Leu Gly 
215 220 225 

gtt gat atg aat gaa cca aat gee tat gga aat aca cct ctt cat gta 1194 
Val Asp Met Asn Glu Pro Asn Ala Tyr Gly Asn Thr Pro Leu His Val 
230 235 240 245 

gee tgc tat aat gga caa gat gtt gta gtg aat gaa ctt ata gac tgt 1242 
Ala Cys Tyr Asn Gly Gin Asp Val Val Val Asn Glu Leu He Asp Cys 
250 255 260 

ggt get att gtg aat caa aag aat gaa aaa gga ttt act cct ttg cac 12 90 

Gly Ala He Val Asn Gin Lys Asn Glu Lys Gly Phe Thr Pro Leu His 
265 270 275 

ttt get get gca tea aca cat gga gca ttg tgt tta gag ctt eta gtt 
Phe Ala Ala Ala Ser Thr His Gly Ala Leu Cys Leu Glu Leu Leu Val 
280 285 290 

ggc aat ggg gee gat gtc aat atg aag agt aaa gat ggg aaa ace cca 
Gly Asn Gly Ala Asp Val Asn Met Lys Ser Lys Asp Gly Lys Thr Pro 
295 300 305 

eta cac atg act get etc cac ggt aga ttc tec cga tea caa ace att 1434 
Leu His Met Thr Ala Leu His Gly Arg Phe Ser Arg Ser Gin Thr He 
310 315 320 325 

ate cag agt gga get gta ate gac tgt gag gat aag aat gga aat acc 1482 
He Gin Ser Gly Ala Val He Asp Cys Glu Asp Lys Asn Gly Asn Thr 
330 335 340 

cct ttg cac ata gca gca egg tat ggc cat gag ctg ctg ate aac act 153 0 

Pro Leu His He Ala Ala Arg Tyr Gly His Glu Leu Leu He Asn Thr 
345 350 355 



1338 



1386 



25 



1722 



1770 



1818 



ctt att aca agt ggt get gac act gca aag cgt ggc ata cat gga atg 1578 
Leu He Thr Ser Gly Ala Asp Thr Ala Lys Arg Gly He His Gly Met 
360 365 370 

ttc ccc etc cat ttg gca gec tta age ggc ttt tea gat tgc tgc aga 1626 
Phe Pro Leu His Leu Ala Ala Leu Ser Gly Phe Ser Asp Cys Cys Arg 
375 380 385 

aaa ctt ctt tct tea gga ttt gat ata gat acc cca gat gat ttt ggc 1674 
Lys Leu Leu Ser Ser Gly Phe Asp He Asp Thr Pro Asp Asp Phe Gly 
390 395 400 405 

agg act tgt eta cat gca get gca get gga ggg aat ttg gag tgc eta 
Arg Thr Cys Leu His Ala Ala Ala Ala Gly Gly Asn Leu Glu Cys Leu 
410 415 420 

aac ctt ctg ctg aat act ggt gca gac ttt aat aaa aag gac aaa ttt 
Asn Leu Leu Leu Asn Thr Gly Ala Asp Phe Asn Lys Lys Asp Lys Phe 
425 430 435 

ggg aga tct cca ctg cac tac get get gec aac tgc aat tac cag tgc 
Gly Arg Ser Pro Leu His Tyr Ala Ala Ala Asn Cys Asn Tyr Gin Cys 
440 445 450 

ctg ttt get ctt gtg gga tea gga gca agt gtg aat gac ctt gat gaa 1866 
Leu Phe Ala Leu Val Gly Ser Gly Ala Ser Val Asn Asp Leu Asp Glu 
455 460 465 

aga ggc tgc aca ccc ctg cac tat gca get aca tea gac aca gat ggc 1914 
Arg Gly Cys Thr Pro Leu His Tyr Ala Ala Thr Ser Asp Thr Asp Gly 
470 475 480 485 

aag tgc ctg gaa tac tta tta aga aac gat gca aat cca ggg ate cgt 
Lys Cys Leu Glu Tyr Leu Leu Arg Asn Asp Ala Asn Pro Gly He Arg 
490 495 500 

gat aag caa gga tac aac gca gtt cat tat tea get get tat ggt cac 
Asp Lys Gin Gly Tyr Asn Ala Val His Tyr Ser Ala Ala Tyr Gly His 
505 510 515 

cgt eta tgt ctt cag ctg att gca agt gaa act cct eta gat gtt tta 
Arg Leu Cys Leu Gin Leu He Ala Ser Glu Thr Pro Leu Asp Val Leu 
520 525 530 

atg gaa acc tea gga aca gac atg ctg agt gat tea gat aat aga gca 
Met Glu Thr Ser Gly Thr Asp Met Leu Ser Asp Ser Asp Asn Arg Ala 
535 540 545 

aca ata age cct tta cac ttg get gec tat cat ggt cac cat caa gca 
Thr He Ser Pro Leu His Leu Ala Ala Tyr His Gly His His Gin Ala 
550 555 560 565 

ctg gaa gtg ttg gta cag tct ttg tta gat ctt gat gtc aga aat agt 
Leu Glu Val Leu Val Gin Ser Leu Leu Asp Leu Asp Val Arg Asn Ser 
570 575 580 

agt gga aga aca ccc eta gat ctt gca get ttt aag ggc cat gtt gaa 



1962 



2010 



2058 



2106 



2154 



2202 



2250 



26 



2298 



2346 



2394 



2442 



2490 



Ser Gly Arg Thr Pro Leu Asp Leu Ala Ala Phe Lys Gly His Val Glu 
585 590 595 

tgt gtg gat gta etc att aat cag gga gec tea ate tta gta aaa gat 
Cys Val Asp Val Leu lie Asn Gin Gly Ala Ser lie Leu Val Lys Asp 
600 605 610 

tac att ttg aag agg aca cct att cat gca gca gca aca aat ggt cat 
Tyr He Leu Lys Arg Thr Pro He His Ala Ala Ala Thr Asn Gly His 
615 620 625 

tea gaa tgc tta egg eta tta ata gga aat gca gaa cca cag aat gca 
Ser Glu Cys Leu Arg Leu Leu He Gly Asn Ala Glu Pro Gin Asn Ala 
630 635 640 645 

gtg gat att caa gat gga aat gga cag acg cct ctg atg eta tct gtt 
Val Asp He Gin Asp Gly Asn Gly Gin Thr Pro Leu Met Leu Ser Val 
650 655 660 

etc aac ggg cac aca gac tgt gtt tac tea ttg ctg aac aaa gga gca 
Leu Asn Gly His Thr Asp Cys Val Tyr Ser Leu Leu Asn Lys Gly Ala 
665 670 675 

aat gta gat gee aaa gat aag tgg gga agg aca gcg ttg cat aga ggg 2 53 8 

Asn Val Asp Ala Lys Asp Lys Trp Gly Arg Thr Ala Leu His Arg Gly 
680 685 690 

gca gtt aca ggc cat gaa gaa tgt gta gat gca tta ctt caa cat ggt 2586 
Ala Val Thr Gly His Glu Glu Cys Val Asp Ala Leu Leu Gin His Gly 
695 700 705 

get aag tgc tta ctt egg gat age agg ggc egg acg cct ata cac ctg 2634 
Ala Lys Cys Leu Leu Arg Asp Ser Arg Gly Arg Thr Pro He His Leu 
710 715 720 725 

tct get gec tgt gga cac att ggt gtt ctt gga gee ctt ttg cag tea 2682 
Ser Ala Ala Cys Gly His He Gly Val Leu Gly Ala Leu Leu Gin Ser 
730 735 740 

gca gca tct atg gat gca aat cca gee aca gca gac aat cat gga tat 2730 
Ala Ala Ser Met Asp Ala Asn Pro Ala Thr Ala Asp Asn His Gly Tyr 
745 750 755 

acg gca ctt cac tgg get tgc tac aat ggt cac gag aca tgt gta gaa 2 77 8 
Thr Ala Leu His Trp Ala Cys Tyr Asn Gly His Glu Thr Cys Val Glu 
760 765 770 

ctg ctt tta gaa cag gaa gtt ttc cag aaa acg gaa gga aat get ttt 2826 
Leu Leu Leu Glu Gin Glu Val Phe Gin Lys Thr Glu Gly Asn Ala Phe 
775 780 785 

agt cca ttg cat tgt gee gtg ata aat gac aac gaa ggt get get gag 2 874 

Ser Pro Leu His Cys Ala Val He Asn Asp Asn Glu Gly Ala Ala Glu 
790 795 800 805 



atg tta att gat aca tta ggt gec age att gtg aac gec aca gat tea 
Met Leu He Asp Thr Leu Gly Ala Ser He Val Asn Ala Thr Asp Ser 



2922 



27 



810 



815 



820 



aaa gga aga act cct etc cat gca gec gec ttc aca gac cat gta gag 
Lys Gly Arg Thr Pro Leu His Ala Ala Ala Phe Thr Asp His Val Glu 
825 830 835 



2970 



tgt tta cag ctg ctg etc age cat aat get caa gtc aat tct gtg gac 3 018 

Cys Leu Gin Leu Leu Leu Ser His Asn Ala Gin Val Asn Ser Val Asp 
840 845 850 

tct aca ggg aaa aca cct ctt atg atg get gca gaa aat gga caa aca 3066 
Ser Thr Gly Lys Thr Pro Leu Met Met Ala Ala Glu Asn Gly Gin Thr 
855 860 865 

aat aca gtt gag atg ctg gtt age agt get agt gca gaa ctg act tta 3114 
Asn Thr Val Glu Met Leu Val Ser Ser Ala Ser Ala Glu Leu Thr Leu 
870 875 880 885 

caa gat aac agt aaa aat act gec etc cat ttg get tgt age aag ggt 3162 
Gin Asp Asn Ser Lys Asn Thr Ala Leu His Leu Ala Cys Ser Lys Gly 
890 895 900 



3210 



3258 



cat gaa act agt gee ttg tta ata ctg gaa aag ata aca gat aga aac 
His Glu Thr Ser Ala Leu Leu He Leu Glu Lys He Thr Asp Arg Asn 
905 910 915 

etc ate aat gca ace aac gca gee ttg caa aca cct ctg cat gtt get 
Leu He Asn Ala Thr Asn Ala Ala Leu Gin Thr Pro Leu His Val Ala 
920 925 930 

gee cga aat ggg eta aca atg gtg gtt cag gaa ctt ttg gga aaa gga 33 06 

Ala Arg Asn Gly Leu Thr Met Val Val Gin Glu Leu Leu Gly Lys Gly 
935 940 945 

gca agt gtg ctt gca gta gat gaa aat ggc tat ace cca get ttg gee 3354 
Ala Ser Val Leu Ala Val Asp Glu Asn Gly Tyr Thr Pro Ala Leu Ala 
950 955 960 965 

tgt get ccc aat aag gat gtg get gat tgc ctg get etc att ttg gee 34 02 

Cys Ala Pro Asn Lys Asp Val Ala Asp Cys Leu Ala Leu He Leu Ala 
970 975 980 

acc atg atg cct gtc tea tea agt agt cct tta tea tec tta aca ttc 3450 
Thr Met Met Pro Val Ser Ser Ser Ser Pro Leu Ser Ser Leu Thr Phe 
985 990 995 

aat gee att aac cgt tat acc aac acc tea aaa aca gtc age ttt gaa 3498 
Asn Ala He Asn Arg Tyr Thr Asn Thr Ser Lys Thr Val Ser Phe Glu 
1000 1005 1010 

get ttg ccc ate atg agg aat gaa cct age tec tat tgc agt ttc aat 3546 
Ala Leu Pro He Met Arg Asn Glu Pro Ser Ser Tyr Cys Ser Phe Asn 
1015 1020 1025 

aac att gga ggg gaa cag gag tac tta tac act gac gtg gat gag etc 3 594 
Asn He Gly Gly Glu Gin Glu Tyr Leu Tyr Thr Asp Val Asp Glu Leu 
1030 1035 1040 1045 



28 



aac gac tec gat tct gag acc tac tga gaggc tgaggaggag ggagttctca 3 646 
Asn Asp Ser Asp Ser Glu Thr Tyr * 
1050 

cagtaaagct tcaaactgtg ctttttcagg aaacaggcac tttgatattc aegtagaaat 3706 

tcaacctaag agggacagtt ccgttgaagc cgt 373 9 



<210> 8 

<211> 2162 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (387) . . (1748) 



<400> 
cgtactgaag 


8 

tggtctgeat 


gagegacegg 


teeggaatte 


ccgggtcgac 


gatttegtte 


eggttgeact 


cttcctatag 


cccagagggc 


gagagggect 


gtggcctggg 


ggaaggagga 


cgaggttctg 


cctggatccc 


agcagtagga 


cgctgtgcca 


tttgggaaca 


aaggaatagt 


ctgcctggaa 


tccctgcaga 


tettggggee 


ggaggccagt 


ccaacccttg 


gagcaggaag 


aaacgcaaag 


ttgtcaagaa 


ccaagtcgag 


ctgcctcaga 


gccggcccgc 


agtagctgea 


gactccgccc 


gcgacgtgtg 


cgcgcttctc 


tgggecagag 


cgagcctgtt 


ttgtgctcgg 


gttaagagat 


ttgtcccagc 


tatacc atg 


ggc cgc act egg gaa get ggc tgc 



Met Gly Arg Thr Arg Glu Ala Gly Cys 
1 5 



60 
12 0 
180 
240 
300 
360 
413 



gtg gec get ggt gtg gtt ate ggg get ggt gee tgc tac tgt gta tac 461 
Val Ala Ala Gly Val Val He Gly Ala Gly Ala Cys Tyr Cys Val Tyr 
10 15 20 25 

aga ctg get tgg gga aga gac gag aac gag aaa ate tgg gac gaa gac 509 
Arg Leu Ala Trp Gly Arg Asp Glu Asn Glu Lys He Trp Asp Glu Asp 
30 35 40 

gag gag tct acg gac acc tea gag att ggg gtt gag act gtg aaa gga 557 
Glu Glu Ser Thr Asp Thr Ser Glu He Gly Val Glu Thr Val Lys Gly 
45 50 55 

get aaa act aac get ggg gca ggg tct ggg gec aaa ctt cag ggt gat 605 
Ala Lys Thr Asn Ala Gly Ala Gly Ser Gly Ala Lys Leu Gin Gly Asp 
60 65 70 

tea gag gtc aag cct gag gtg agt ttg gga etc gag gat tgt ccg ggt 653 
Ser Glu Val Lys Pro Glu Val Ser Leu Gly Leu Glu Asp Cys Pro Gly 
75 80 85 



29 



gta aaa gag aag gcc cat tea gga tec cac age gga ggt ggt eta gag 
Val Lys Glu Lys Ala His Ser Gly Ser His Ser Gly Gly Gly Leu Glu 
90 95 100 105 



701 



gcc aag gcc aag gcc ctt ttc aac acg ctg aag gaa cag gca agt gca 749 
Ala Lys Ala Lys Ala Leu Phe Asn Thr Leu Lys Glu Gin Ala Ser Ala 
110 115 120 

aag gca ggc aaa ggg get agg gtg ggt acc ate tct ggg aac agg acc 7 97 

Lys Ala Gly Lys Gly Ala Arg Val Gly Thr He Ser Gly Asn Arg Thr 
125 130 135 

ctt gca ccg agt tta ccc tgc cca gga ggc agg ggt gga ggc tgc cac 845 
Leu Ala Pro Ser Leu Pro Cys Pro Gly Gly Arg Gly Gly Gly Cys His 
140 145 150 

ccc acc agg agt gga tct agg gcc ggg ggc agg gca agt gga aaa tec 8 93 

Pro Thr Arg Ser Gly Ser Arg Ala Gly Gly Arg Ala Ser Gly Lys Ser 
155 160 165 

aag gga aag gcc cga agt aag age acc agg get cca get aca aca tgg 941 
Lys Gly Lys Ala Arg Ser Lys Ser Thr Arg Ala Pro Ala Thr Thr Trp 
170 175 180 185 

cct gtc egg aga ggc aag ttc aac ttt cct tat aaa att gat gat att 989 
Pro Val Arg Arg Gly Lys Phe Asn Phe Pro Tyr Lys He Asp Asp He 
190 195 200 

ctg agt get ccc gac etc caa aag gtc etc aac ate ctg gag cga aca 1037 
Leu Ser Ala Pro Asp Leu Gin Lys Val Leu Asn He Leu Glu Arg Thr 
205 210 215 

aat gat cct ttt att caa gaa gta gcc ttg gtc act ctg ggt aac aat 1085 
Asn Asp Pro Phe He Gin Glu Val Ala Leu Val Thr Leu Gly Asn Asn 
220 225 230 

gca gca tat tea ttt aac cag aat gcc ata cgt gaa ttg ggt ggt gtc 1133 
Ala Ala Tyr Ser Phe Asn Gin Asn Ala He Arg Glu Leu Gly Gly Val 
235 240 245 

cca att att gca aaa ctg ata aaa aca aaa gac ccc ata att agg gaa 1181 
Pro He He Ala Lys Leu He Lys Thr Lys Asp Pro He He Arg Glu 
250 255 260 265 

aag act tac aat gcc ctt aat aac ttg agt gtg aac gca gaa aat cag 
Lys Thr Tyr Asn Ala Leu Asn Asn Leu Ser Val Asn Ala Glu Asn Gin 
270 275 280 

ggc aag att aag acg tac ate agt caa gtg tgt gat gac acc atg gtc 1277 
Gly Lys He Lys Thr Tyr He Ser Gin Val Cys Asp Asp Thr Met Val 
285 290 295 

tgt cgc ttg gac tea get gtg cag atg get ggg eta aga ctg tta acc 1325 
Cys Arg Leu Asp Ser Ala Val Gin Met Ala Gly Leu Arg Leu Leu Thr 
300 305 310 



1229 



30 



aac atg act gtg act aat cat tac caa cat ttg ctt tec tat tct ttt 1373 
Asn Met Thr Val Thr Asn His Tyr Gin His Leu Leu Ser Tyr Ser Phe 
315 320 325 

cca gac ttt ttt get ttg tta ttc ctg gga aat cac ttc acc aag ata 1421 
Pro Asp Phe Phe Ala Leu Leu Phe Leu Gly Asn His Phe Thr Lys lie 
330 335 340 345 

cag att atg aaa eta att ata aac ttt act gaa aat cca gec atg aca 1469 
Gin lie Met Lys Leu lie lie Asn Phe Thr Glu Asn Pro Ala Met Thr 
350 355 360 

aga gag ctg gtc agt tgt aaa gta cca tea gaa ttg att tec etc ttt 1517 
Arg Glu Leu Val Ser Cys Lys Val Pro Ser Glu Leu lie Ser Leu Phe 
365 370 375 

aat aaa gaa tgg gat aga gag att ctt ctt aat ate ctt acc eta ttt 1565 
Asn Lys Glu Trp Asp Arg Glu lie Leu Leu Asn He Leu Thr Leu Phe 
380 385 390 

gag aat ata aat gac aac ata aaa aat gaa ggg etc gca tea tec agg 1613 
Glu Asn He Asn Asp Asn He Lys Asn Glu Gly Leu Ala Ser Ser Arg 
395 400 405 

aaa gaa ttc age aga agt tea ctt ttt ttc tta ttc aaa gag tct gga 1661 
Lys Glu Phe Ser Arg Ser Ser Leu Phe Phe Leu Phe Lys Glu Ser Gly 
410 415 420 425 

gtt tgt gtt aag aaa ate aaa gca eta gca aat cac aat gat ctg gtg 1709 
Val Cys Val Lys Lys He Lys Ala Leu Ala Asn His Asn Asp Leu Val 
430 435 440 

gtg aaa gta aaa gtc ctg aaa gta tta acc aaa etc taa tttggagtct 1758 
Val Lys Val Lys Val Leu Lys Val Leu Thr Lys Leu * 
445 450 

gtcccaaaca atattgagat atttgeagtt ggtacgatgt gatttgtaaa ttctttgttt 1818 

ttcattgtgc gtatatggta aagagatctt ttcagctget attttggaat aatgactatc 1878 

atatatcata acagtgactg atgttggttg taatggttgg gtttaggatg aaccatttta 193 8 

aggatgecaa atgaaatatt agtatttgta cacagaaaga atttattgat ttgatcttat 1998 

tacctagatt gagatttttt aatctttcct ctacctaaac tgacaatgaa ttggttatac 2058 

ateatgeata agctacactt ttatattagt ttatatttgt tattctaaga cttgtgtttc 2118 

atcaataaag ttgtgtttta agcagcagaa aaaaaaaaaa aaaa 2162 



<210> 9 

<211> 1614 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> CDS 

<222> (202) . . (969) 

<400> 9 

aagctggtac gcctgcaggt accggtccgg aattcccggg tcgacgattt cgtcaaagag 60 

acaaactcca ttttcttatg aatggaaagt gaaaacccct gttccgctta aattgggttc 12 0 

cttcctgtcc tgagaaacat agagaccccc aaaagggaag cagaggagag aaagtcccac 180 

acccagaccc cgcgagaaga g atg acc atg acc acc atg cca gaa agt etc 231 

Met Thr Met Thr Thr Met Pro Glu Ser Leu 
15 10 

aac age ccc gtg teg ggc aag gcg gtg ttt atg gag ttt ggg ccg ccc 279 
Asn Ser Pro Val Ser Gly Lys Ala Val Phe Met Glu Phe Gly Pro Pro 
15 20 25 

aac cag caa atg tct cct tec ccc atg tec cac ggg cac tac tec atg 327 
Asn Gin Gin Met Ser Pro Ser Pro Met Ser His Gly His Tyr Ser Met 
30 35 40 

cac tgt tta cac teg gcg ggc cat teg cag ccc gac ggc gec tac age 375 
His Cys Leu His Ser Ala Gly His Ser Gin Pro Asp Gly Ala Tyr Ser 
45 50 55 

tea gee teg tec ttc tec cga ccg ctg ggc tac ccc tac gtc aac teg 423 
Ser Ala Ser Ser Phe Ser Arg Pro Leu Gly Tyr Pro Tyr Val Asn Ser 
60 65 70 

gtc age age cac gca tec age ccc tac ate agt teg gtg cag tec tac 471 
Val Ser Ser His Ala Ser Ser Pro Tyr lie Ser Ser Val Gin Ser Tyr 
75 80 85 90 

ccg ggc age gec age etc gec cag age cgc ctg gag gac cca ggg gcg 519 
Pro Gly Ser Ala Ser Leu Ala Gin Ser Arg Leu Glu Asp Pro Gly Ala 
95 100 105 

gac teg gag aag age acg gtg gtg gaa ggc ggt gaa gtg cgc ttc aat 567 
Asp Ser Glu Lys Ser Thr Val Val Glu Gly Gly Glu Val Arg Phe Asn 
110 115 120 

ggc aag gga aaa aag ate cgt aaa ccc agg acg att tat tec agt ttg 615 
Gly Lys Gly Lys Lys lie Arg Lys Pro Arg Thr lie Tyr Ser Ser Leu 
125 130 135 

cag ttg cag get ttg aac egg agg ttc cag caa act cag tac eta get 663 
Gin Leu Gin Ala Leu Asn Arg Arg Phe Gin Gin Thr Gin Tyr Leu Ala 
140 145 150 

ctg ccg gag agg gcg gag etc gcg gee tct ttg gga etc aca cag act 711 
Leu Pro Glu Arg Ala Glu Leu Ala Ala Ser Leu Gly Leu Thr Gin Thr 
155 160 165 170 

cag gtc aag ate tgg ttc caa aac aag cga tec aag ttc aag aag ctg 75 9 



32 



Gin Val Lys He Trp Phe Gin Asn Lys Arg Ser Lys Phe Lys Lys Leu 
175 180 185 

atg aag cag ggt ggg gcg get ctg gag ggt agt gcg ttg gec aac ggg 
Met Lys Gin Gly Gly Ala Ala Leu Glu Gly Ser Ala Leu Ala Asn Gly 
190 195 200 



aac tct tea tec ggg aag ggc tea gga gga aac gcg ggc tec tat ate 
Asn Ser Ser Ser Gly Lys Gly Ser Gly Gly Asn Ala Gly Ser Tyr He 
220 225 230 



caa ccc caa ctt atg tga ggttgc ccgcccgtct ccttcttgtc tccccggccc 
Gin Pro Gin Leu Met * 
255 



aggtccctcc 


cgcctccagg 


tccatccatc 


ccgtccggaa 


aagaaggacc 


cagagggaag 


aaggaacagt 


ggaggcggga 


cgccctccat 


ctcctcggag 


ccccgcgagg 


tccggcccag 


caacttcccg 


gcatccgcgc 


tetagectga 


accctggcct 


gggecgagea 


gtggcagcag 


agagtggcct 


eggagggaag 


ccactgccac 


ctgagacagc 


ccaagcagca 


agataaaccc 


gctccacccg 


acccgccgac 


cttcagcttt 


gtgggactat 


caggaaaaaa 


caaaacaaaa 


acaaaatgta 


gaaaaagcaa 


aagctctttt 


ctgtcctgtc 


agtctcctgt 


ctccttttgc 


tctgtctgtg 


cgctggtaaa 


gtccaggtcc 


tcatccgtcc 


gctgtcctca 


ttctgcggcc 


tcagcaaaaa 


gecacaaggt 


ctgagcggcc 


cgggtcctgc 


cgggctgacc 


atetceggat 


cctgggacac 


tctgcctgac 


catctgtgta 


gctggtgtgg 


gaatctgggg 


gcattggagg 


gagggggttt 


aegaaategt 


egacagegaa gtcgacccgg gaattccggg ccggtacgtg 


caggegtae 













807 



egg gec ctg tct get ggc tec cca ccc gtg ccg ccc ggc tgg aac cct 855 
Arg Ala Leu Ser Ala Gly Ser Pro Pro Val Pro Pro Gly Trp Asn Pro 
205 210 215 



903 



ccc age tac aca teg tgg tac cct tea gcg cac caa gaa get atg cag 951 
Pro Ser Tyr Thr Ser Trp Tyr Pro Ser Ala His Gin Glu Ala Met Gin 
235 240 245 250 



1005 

1065 
1125 
1185 
1245 
1305 
1365 
1425 
1485 
1545 
1605 
1614 
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<222> (154) . . (3240) 
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<400> 10 

ccctctaatc tgatcactat agggaatttg gccctcgagc cgaagaattc ggcacgaggt 60 



ctggctggga gcagaaggca gcctcggtct ctgggcggcg gcggcggccc actctgccct 120 

ggccgcgctg tgtggtgacc gcaggcccga gac atg agg gcg gcc cgt get ctg 174 

Met Arg Ala Ala Arg Ala Leu 
1 5 

ctg ccc ctg ctg ctg cag gcc tgc tgg aca gcc gcg cag gat gag ccg 222 
Leu Pro Leu Leu Leu Gin Ala Cys Trp Thr Ala Ala Gin Asp Glu Pro 
10 15 20 

gag acc ccg agg gcc gtg gcc ttc cag gac tgc ccc gtg gac ctg ttc 270 
Glu Thr Pro Arg Ala Val Ala Phe Gin Asp Cys Pro Val Asp Leu Phe 
25 30 35 

ttt gtg ctg gac acc tct gag age gtg gcc ctg agg ctg aag ccc tac 318 
Phe Val Leu Asp Thr Ser Glu Ser Val Ala Leu Arg Leu Lys Pro Tyr 
40 45 50 55 

ggg gcc etc gtg gac aaa gtc aag tec ttc acc aag cgc ttc ate gac 366 
Gly Ala Leu Val Asp Lys Val Lys Ser Phe Thr Lys Arg Phe lie Asp 
60 65 70 

aac ctg agg gac agg tac tac cgc tgt gac cga aac ctg gtg tgg aac 414 
Asn Leu Arg Asp Arg Tyr Tyr Arg Cys Asp Arg Asn Leu Val Trp Asn 
75 80 85 

gca ggc gcg ctg cac tac agt gac gag gtg gag ate ate caa ggc etc 462 
Ala Gly Ala Leu His Tyr Ser Asp Glu Val Glu He He Gin Gly Leu 
90 95 100 

acg cgc atg cct ggc ggc cgc gac gca etc aaa age age gtg gac gcg 510 
Thr Arg Met Pro Gly Gly Arg Asp Ala Leu Lys Ser Ser Val Asp Ala 
105 110 115 

gtc aag tac ttt ggg aag ggc acc tac acc gac tgc get ate aag aag 5 58 

Val Lys Tyr Phe Gly Lys Gly Thr Tyr Thr Asp Cys Ala He Lys Lys 
120 125 130 135 

ggg ctg gag cag etc etc gtg ggg ggc tec cac ctg aag gag aat aag 606 
Gly Leu Glu Gin Leu Leu Val Gly Gly Ser His Leu Lys Glu Asn Lys 
140 145 150 

tac ctg att gtg gtg acc gac ggg cac ccc ctg gag ggc tac aag gaa 654 
Tyr Leu He Val Val Thr Asp Gly His Pro Leu Glu Gly Tyr Lys Glu 
155 160 165 

ccc tgt ggg ggg ctg gag gat get gtg aac gag gcc aag cac ctg ggc 7 02 

Pro Cys Gly Gly Leu Glu Asp Ala Val Asn Glu Ala Lys His Leu Gly 
170 175 180 

gtc aaa gtc ttc teg gtg gcc ate aca ccc gac cac ctg gag ccg cgt 750 
Val Lys Val Phe Ser Val Ala He Thr Pro Asp His Leu Glu Pro Arg 
185 190 195 



34 



846 



894 



942 



990 



1038 



1086 



ctg age ate ate gee acg gac cac acg tac egg cgc aac ttc acg gcg 798 
Leu Ser He He Ala Thr Asp His Thr Tyr Arg Arg Asn Phe Thr Ala 
200 205 210 215 

get gac tgg ggc cag age cgc gac gca gag gag gec ate age cag ace 
Ala Asp Trp Gly Gin Ser Arg Asp Ala Glu Glu Ala He Ser Gin Thr 
220 225 230 

ate gac ace ate gtg gac atg ate aaa aat aac gtg gag caa gtg tgc 
He Asp Thr He Val Asp Met He Lys Asn Asn Val Glu Gin Val Cys 
235 240 245 

tgc tec ttc gaa tgc cag cct gca aga gga cct ccg ggg etc egg ggc 
Cys Ser Phe Glu Cys Gin Pro Ala Arg Gly Pro Pro Gly Leu Arg Gly 
250 255 260 

gac ccc ggc ttt gag gga gaa cga ggc aag ccg ggg etc cca gga gag 
Asp Pro Gly Phe Glu Gly Glu Arg Gly Lys Pro Gly Leu Pro Gly Glu 
265 270 275 

aag gga gaa gec gga gat cct gga aga ccc ggg gac etc gga cct gtt 
Lys Gly Glu Ala Gly Asp Pro Gly Arg Pro Gly Asp Leu Gly Pro Val 
280 285 290 295 

ggg tac cag gga atg aag gga gaa aaa ggg age cgt ggg gag aag ggc 
Gly Tyr Gin Gly Met Lys Gly Glu Lys Gly Ser Arg Gly Glu Lys Gly 
300 305 310 

tec agg gga ccc aag ggc tac aag gga gag aag ggc aag cgt ggc ate 1134 
Ser Arg Gly Pro Lys Gly Tyr Lys Gly Glu Lys Gly Lys Arg Gly He 
315 320 325 

gac ggg gtg gac ggc gtg aag ggg gag atg ggg tac cca ggc ctg cca 1182 
Asp Gly Val Asp Gly Val Lys Gly Glu Met Gly Tyr Pro Gly Leu Pro 
330 335 340 

ggc tgc aag ggc teg ccc ggg ttt gac ggc att caa gga ccc cct ggc 12 3 0 

Gly Cys Lys Gly Ser Pro Gly Phe Asp Gly He Gin Gly Pro Pro Gly 
345 350 355 

ccc aag gga gac ccc ggc gee ttt gga ctg aaa gga gaa aag ggc gag 1278 
Pro Lys Gly Asp Pro Gly Ala Phe Gly Leu Lys Gly Glu Lys Gly Glu 
360 365 370 375 

cct gga get gac ggg gag gcg ggg aga cca ggg age teg gga cca tct 1326 
Pro Gly Ala Asp Gly Glu Ala Gly Arg Pro Gly Ser Ser Gly Pro Ser 
380 385 390 

gga gac gag ggc cag ccg gga gag cct ggg ccc ccc gga gag aaa gga 1374 
Gly Asp Glu Gly Gin Pro Gly Glu Pro Gly Pro Pro Gly Glu Lys Gly 
395 400 405 

gag gcg ggc gac gag ggg aac cca gga cct gac ggt gee ccc ggg gag 1422 
Glu Ala Gly Asp Glu Gly Asn Pro Gly Pro Asp Gly Ala Pro Gly Glu 
410 415 420 

egg ggt ggc cct gga gag aga gga cca egg ggg acc cca ggc acg egg 1470 



35 



Arg Gly Gly Pro Gly Glu Arg Gly Pro Arg Gly Thr Pro Gly Thr Arg 
425 430 435 

gga cca aga gga gac cct ggt gaa get ggc ccg cag ggt gat cag gga 1518 
Gly Pro Arg Gly Asp Pro Gly Glu Ala Gly Pro Gin Gly Asp Gin Gly 
440 445 450 455 

aga gaa ggc ccc gtt ggt gtc cct gga gac ccg ggc gag get ggc cct 1566 
Arg Glu Gly Pro Val Gly Val Pro Gly Asp Pro Gly Glu Ala Gly Pro 
460 465 470 

ate gga cct aaa ggc tac cga ggc gat gag ggt ccc cca ggg tec gag 1614 
He Gly Pro Lys Gly Tyr Arg Gly Asp Glu Gly Pro Pro Gly Ser Glu 
475 480 485 



ata aac ggc acg aag ggc tac ccc ggc etc aag ggg gac gag gga gaa 
He Asn Gly Thr Lys Gly Tyr Pro Gly Leu Lys Gly Asp Glu Gly Glu 
540 545 550 

gec ggg gac ccc gga gac gat aac aac gac att gca ccc cga gga gtc 
Ala Gly Asp Pro Gly Asp Asp Asn Asn Asp He Ala Pro Arg Gly Val 
555 560 565 

aaa gga gca aag ggg tac egg ggt ccc gag ggc ccc cag gga ccc cca 
Lys Gly Ala Lys Gly Tyr Arg Gly Pro Glu Gly Pro Gin Gly Pro Pro 
570 575 580 



etc ctg ttc gtg ctg gac age tea gag age att ggc ctg cag aac ttc 

Leu Leu Phe Val Leu Asp Ser Ser Glu Ser He Gly Leu Gin Asn Phe 
620 625 630 

gag att gee aag gac ttc gtc gtc aag gtc ate gac egg ctg age egg 

Glu He Ala Lys Asp Phe Val Val Lys Val He Asp Arg Leu Ser Arg 
635 640 645 

gac gag ctg gtc aag ttc gag cca ggg cag teg tac gcg ggt gtg gtg 

Asp Glu Leu Val Lys Phe Glu Pro Gly Gin Ser Tyr Ala Gly Val Val 



1662 



ggt gec aga gga gec cca gga cct gec gga ccc cct gga gac ccg ggg 
Gly Ala Arg Gly Ala Pro Gly Pro Ala Gly Pro Pro Gly Asp Pro Gly 
490 495 500 

ctg atg ggt gaa agg gga gaa gac ggc ccc get gga aat ggc ace gag 1710 
Leu Met Gly Glu Arg Gly Glu Asp Gly Pro Ala Gly Asn Gly Thr Glu 
505 510 515 

ggc ttc ccc ggc ttc ccc ggg tat ccg ggc aac agg ggc get ccc ggg 1758 
Gly Phe Pro Gly Phe Pro Gly Tyr Pro Gly Asn Arg Gly Ala Pro Gly 

520 525 530 535 



1806 



1854 



1902 



gga cac caa gga ccg cct ggg ccg gac gaa tgc gag att ttg gac ate 1950 

Gly His Gin Gly Pro Pro Gly Pro Asp Glu Cys Glu He Leu Asp He 
585 590 595 

ate atg aaa atg tgc tct tgc tgt gaa tgc aag tgc ggc ccc ate gac 1998 

He Met Lys Met Cys Ser Cys Cys Glu Cys Lys Cys Gly Pro He Asp 

600 605 610 615 



2046 



2094 



2142 
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650 655 660 

cag tac age cac age cag atg cag gag cac gtg age ctg cgc age ccc 2190 
Gin Tyr Ser His Ser Gin Met Gin Glu His Val Ser Leu Arg Ser Pro 
665 670 675 

age ate egg aac gtg cag gag etc aag gaa gee ate aag age ctg cag 223 8 
Ser He Arg Asn Val Gin Glu Leu Lys Glu Ala He Lys Ser Leu Gin 
680 685 690 695 

tgg atg gcg ggc ggc ace ttc acg ggg gag gee ctg cag tac acg egg 22 86 
Trp Met Ala Gly Gly Thr Phe Thr Gly Glu Ala Leu Gin Tyr Thr Arg 
700 705 710 

gac cag ctg ctg ccg ccc age ccg aac aac cgc ate gee ctg gtc ate 2334 
Asp Gin Leu Leu Pro Pro Ser Pro Asn Asn Arg He Ala Leu Val He 
715 720 725 



act gac ggg cgc tea gac act cag agg gac ace aca ccg etc aac gtg 
Thr Asp Gly Arg Ser Asp Thr Gin Arg Asp Thr Thr Pro Leu Asn Val 
730 735 740 

etc tgc age ccc ggc ate cag gtg gtc tec gtg ggc ate aaa gac gtg 
Leu Cys Ser Pro Gly He Gin Val Val Ser Val Gly He Lys Asp Val 
745 750 755 

ttt gac ttc ate cca ggc tea gac cag etc aat gtc att tct tgc caa 
Phe Asp Phe He Pro Gly Ser Asp Gin Leu Asn Val He Ser Cys Gin 
760 765 770 775 



2382 



2430 



2478 



ggc ctg gca cca tec cag ggc egg ccc ggc etc teg ctg gtc aag gag 2 526 

Gly Leu Ala Pro Ser Gin Gly Arg Pro Gly Leu Ser Leu Val Lys Glu 
780 785 790 

aac tat gca gag ctg ctg gag gat gee ttc ctg aag aat gtc ace gec 2 574 

Asn Tyr Ala Glu Leu Leu Glu Asp Ala Phe Leu Lys Asn Val Thr Ala 
795 800 805 

cag ate tgc ata gac aag aag tgt cca gat tac ace tgc ccc ate acg 2622 
Gin He Cys He Asp Lys Lys Cys Pro Asp Tyr Thr Cys Pro He Thr 
810 815 820 

ttc tec tec ccg get gac ate acc ate ctg ctg gac ggc tec gee age 
Phe Ser Ser Pro Ala Asp He Thr He Leu Leu Asp Gly Ser Ala Ser 
825 830 835 

gtg ggc age cac aac ttt gac acc acc aag cgc ttc gee aag cgc ctg 
Val Gly Ser His Asn Phe Asp Thr Thr Lys Arg Phe Ala Lys Arg Leu 
840 845 850 855 

gee gag cgc ttc etc aca gcg ggc agg acg gac ccc gec cac gac gtg 2766 
Ala Glu Arg Phe Leu Thr Ala Gly Arg Thr Asp Pro Ala His Asp Val 
860 865 870 

egg gtg gcg gtg gtg cag tac age ggc acg ggc cag cag cgc cca gag 2 814 
Arg Val Ala Val Val Gin Tyr Ser Gly Thr Gly Gin Gin Arg Pro Glu 
875 880 885 



2670 



2718 
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egg gcg teg ctg cag ttc ctg cag aac tac acg gec ctg gec agt gec 2862 
Arg Ala Ser Leu Gin Phe Leu Gin Asn Tyr Thr Ala Leu Ala Ser Ala 
890 895 900 



gtc gat gec atg gac ttt ate aac gac gec acc gac gtc aac gat gec 
Val Asp Ala Met Asp Phe He Asn Asp Ala Thr Asp Val Asn Asp Ala 
905 910 915 



aag aag agg ctg ctg etc ttc tea gat ggc aac teg cag ggc gec acg 

Lys Lys Arg Leu Leu Leu Phe Ser Asp Gly Asn Ser Gin Gly Ala Thr 
940 945 950 

ccc get gee ate gag aag gec gtg cag gaa gec cag egg gca ggc ate 

Pro Ala Ala He Glu Lys Ala Val Gin Glu Ala Gin Arg Ala Gly He 
955 960 965 

gag ate ttc gtg gtg gtc gtg ggc cgc cag gtg aat gag ccc cac ate 

Glu He Phe Val Val Val Val Gly Arg Gin Val Asn Glu Pro His He 
970 975 980 



tgcacgccgg 


caccaaaccc 


tgtcctccca 


cccctcccca 


ctcatcacta 


aacagagaaa 


agcttggaaa 


gccaggacac 


aacgetgetg 


ectgetttgt 


gcagggtcct 


ccggggctca 


gecctgagtt 


ggcatcacct 


gcgcagggcc 


ctctggggct 


cagccctgag 


ctagtgtcac 


ctgcacaggg 


ccctctgggg 


ctcagccctg 


agctggcgtc 


acctgtgcag 


ggccctctgg 


ggctcagccc 


tgagctggcc 


tcacctgggt 


tccccacccc 


gggctctcct 


gccctgccct 


cctgcccgcc 


ctccctcctg 


cctgcgcagc 


tccttcccta 


ggcacctctg 


tgctgcgtcc 


caccagcctg 


ageaagaege 


cctctcgggg 


cctgtgccgc 


actagcctcc 


ctctcctctg 


tccccatagc 


tggtttttcc 


caccaatcct 


cacctaacag 


ttactttaca 


attaaactca 


aagcaagctc 


ttctcctcag 


cttggggcag 


ccattggcct 


ctgtctcgtt 


ttgggaaacc 


aaggtcagga 


ggccgttgca 


gacataaatc 


tcggcgactc 


ggccccgtct 


cctgagggtc 



2910 



ctg ggc tat gtg acc cgc ttc tac cgc gag gec teg tec ggc get gec 2 95 8 
Leu Gly Tyr Val Thr Arg Phe Tyr Arg Glu Ala Ser Ser Gly Ala Ala 
920 925 930 935 



3006 



3054 



3102 



cgc gtc ctg gtc acc ggc aag acg gec gag tac gac gtg gec tac ggc 315 0 

Arg Val Leu Val Thr Gly Lys Thr Ala Glu Tyr Asp Val Ala Tyr Gly 
985 990 995 

gag age cac ctg ttc cgt gtc ccc age tac cag gec ctg etc cgc ggt 3198 
Glu Ser His Leu Phe Arg Val Pro Ser Tyr Gin Ala Leu Leu Arg Gly 
1000 1005 1010 1015 

gtc ttc cac cag aca gtc tec agg aag gtg gcg ctg ggc tag cccaccc 3247 
Val Phe His Gin Thr Val Ser Arg Lys Val Ala Leu Gly * 
1020 1025 

3307 
3367 
3427 
3487 
3547 
3607 
3667 
3727 
3787 
3847 
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ctgctggtga ccggcctgga ccttggccct acagccctgg aggccgctgc tgaccagcac 3907 

tgaccccgac ctcagagagt actcgcaggg gcgctggctg cactcaagac cctcgagatt 3 967 

aacggtgcta accccgtctg ctcctccctc ccgcagagac tggggcctgg actggacatg 402 7 

agagcccctt ggtgccacag agggctgtgt cttactagaa acaacgcaaa cctctccttc 4087 

ctcagaatag tgatgtgttc gacgttttat caaaggcccc ctttctatgt tcatgttagt 4147 

tttgctcctt ctgtgttttt ttctgaacca tatccatgtt gctgactttt ccaaataaag 4207 

gttttcactc ctcaaaaaaa aaaaaaaaag ggcggccgct ctagagtatc cctcgagggg 4267 

cccaagctta cgcgtaccca gctttcttgt acaaagtggt ccctatagtg agtcgtatat 4327 

aagctaggga cgtttcagcg acag 4351 



<210> 11 

<211> 1622 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (864) . . (1424) 



<400> 11 



taagcttgcg 


gccgctgaac 


tacagctctg 


cgcctgccca 


ggcggccgca 


cgctcagggg 


60 


cgtggcatgg gtgggtcgtg 


agttgggcgg 


ggcccacagg 


gcgtgcgcga 


cgcagcggcg 


120 


cggcgcgagg 


cgtaaggggc 


gtggcgccag 


tgggcgtggc 


gtggcgcagt 


gcgaagggac 


180 


gcggtgcgca 


tgcgcgtgag 


ggcttccacg ggtgggtggt 


atcgaggcct 


gtcgggtcag 


240 


ggcggttcgc 


gggtgctgtc 


agagctgggc 


cggggcccct 


aggcaggccc 


agacatgtcc 


300 


gtccttgtaa gttaaaagct 


tccatgggag 


ccttccttcc 


taatcaagat 


gcaaatagta 


360 


cggtattccg 


aacagacact 


aaaaatagct 


gtcatctcaa 


agaatccagt 


gcttgtgtca 


420 


cagtatgaga 


aagtagatgc 


tggggaacag 


cgtttaatga 


atgaggcatg 


caagccagcc 


480 


agtgatctct 


ttggaccttg 


cattctccat 


cagattggat 


cacctcccac 


cctgaggccc 


540 


cccaagactt 


tgaacagttc 


ttcagtcatc 


cttacagaaa 


gataccctct 


ccagacaaac 


600 


gcagtattta 


tatacggtcc 


attggatctc 


tatgaagcac 


cagaattatc 


agtgaagaat 


660 


atattaaatg 


gctcacgggc 


tactgtaaag 


catatttcta 


tcgcttgaga 


gtaaaactcc 


720 


tagaaccagt 


tcctgtttct 


gtaacaagat 


gttcctttag 


agtcaatgag 


aacacacaca 


780 



39 



1034 



1082 



acctacaaat tcatgcaggg gacatcctga agttcttgaa aaagaagaaa cctgaagatg 84 0 

ccttctgtgt tgtgggaata aca atg att gat ctt tac cca aga gac teg 890 

Met lie Asp Leu Tyr Pro Arg Asp Ser 
1 5 

tgg aat ttt gtc ttt gga cag gec tct ttg aca gat ggt gtg ggg ata 93 8 

Trp Asn Phe Val Phe Gly Gin Ala Ser Leu Thr Asp Gly Val Gly He 
10 15 20 25 

ttc age ttt gec agg tat ggc agt gat ttt tat age atg cac tat aaa 986 
Phe Ser Phe Ala Arg Tyr Gly Ser Asp Phe Tyr Ser Met His Tyr Lys 
30 35 40 

ggc aaa gtg aag aag etc aag aaa aca tct tea agt gac tat tea att 
Gly Lys Val Lys Lys Leu Lys Lys Thr Ser Ser Ser Asp Tyr Ser He 
45 50 55 

ttc gac aac tat tat att cca gaa ata act agt gtt tta eta ctt cga 
Phe Asp Asn Tyr Tyr He Pro Glu He Thr Ser Val Leu Leu Leu Arg 
60 65 70 

tec tgt aag act tta acc cat gag ate gga cac ata ttt gga ctg cga 113 0 

Ser Cys Lys Thr Leu Thr His Glu He Gly His He Phe Gly Leu Arg 
75 80 85 

cac tgc cag tgg ctt gca tgc etc atg caa ggc tec aac cac ttg gaa 1178 
His Cys Gin Trp Leu Ala Cys Leu Met Gin Gly Ser Asn His Leu Glu 
90 95 100 105 

gaa get gac egg cgc cct eta aac ctt tgc cct ate tgt ttg cac aag 1226 
Glu Ala Asp Arg Arg Pro Leu Asn Leu Cys Pro He Cys Leu His Lys 
110 115 120 

ttg cag tgt get gtt ggc ttc age att gta gaa aga tac aaa gca ctg 
Leu Gin Cys Ala Val Gly Phe Ser He Val Glu Arg Tyr Lys Ala Leu 
125 130 135 

gtg agg tgg att gat gat gaa tct tct gac aca cct gga gca act cca 1322 
Val Arg Trp He Asp Asp Glu Ser Ser Asp Thr Pro Gly Ala Thr Pro 
140 145 150 

gaa cac agt cac gag gat aat ggg aat tta ccg aaa ccc gtg gaa gee 1370 
Glu His Ser His Glu Asp Asn Gly Asn Leu Pro Lys Pro Val Glu Ala 
155 160 165 

ttt aag gaa tgg aaa gag tgg ata ata aaa tgc ctg get gtt etc caa 1418 
Phe Lys Glu Trp Lys Glu Trp He He Lys Cys Leu Ala Val Leu Gin 
170 175 180 185 

aaa tga ggaccttcaa ataggagtga ttgaaataaa taactacttg catgttatgc 14 74 
Lys * 

tttcatttgg gtggaatact tcattggaat aaactactga tcttgtgctg tgtcaaagta 1534 

acagactaga accttctttc aagtacctga attgaaatga aactcatttt gaataataaa 1594 



1274 



40 



aactctagaa actcttaaaa aaaaaaaa 



1622 



<210> 12 

<211> 882 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (147) . . (284) 

<400> 12 

gagccgagat cacgccactg cactccagcc tgggcaacaa gagcaaaact ctgtctcaat 60 

aaaaaaaaag gaaaacccac agccaacatt atcactagtg gtaacaggga aaatgtcccc 12 0 

tttgaccccc accccaagta cagaac atg caa gga cgc ctg ttc tea ctg cat 173 

Met Gin Gly Arg Leu Phe Ser Leu His 
1 5 

gtc ttc tgt gtt gta ctg aag etc eta get ggt gca gtg agg caa gac 221 
Val Phe Cys Val Val Leu Lys Leu Leu Ala Gly Ala Val Arg Gin Asp 
10 15 20 25 



aaa gaa agg aaa tat agg tgc act ggg aag gaa gaa gaa aca ctg cct 
Lys Glu Arg Lys Tyr Arg Cys Thr Gly Lys Glu Glu Glu Thr Leu Pro 
30 35 40 



269 



tta ttc ttt agg tga catgattgtg tgcttttaaa ataataaagg aatcaacaga 324 
Leu Phe Phe Arg * 
45 



aaagttgctt 


aacctaatga 


atgagtttat 


caaagtcaca 


agatacaagg 


tcagtataca 


384 


aaaatcagtt 


ggatttctac 


atggtagaaa 


caactgtaca 


tggaaaaatg 


tttaatagtg 


444 


taagatatgt 


acattggaaa 


ctatgaaaga 


gtgtaaaaaa 


taaagaagtg 


aaataaatgg 


504 


agaagatacc 


accttgatgg 


atggaagcct 


taaggtaaag 


atgetcatte 


tccccacact 


564 


gacctgtaca 


ttccccacag gectaatcaa 


aacccccaca 


ggcttctgtg 


tagaaattga 


624 


catgetgate 


ctgaaattta 


tatgaaaatg 


caaagagtct 


ggaataacca 


aaataatttt 


684 


gtaaaagaac 


aaagaagact 


tctactacct 


ggttataaga 


cttctctgaa 


gcacagaagt 


744 


caaggcagtg 


tggtggtggc 


ataagtaatg 


taaatcatcc 


aggtgtggtg 


gctcaagcca 


804 


gtcatccagc 


actttgggag 


gctgaggcag 


gagaatcget 


agageccagg 


agttggaaac 


864 


cagcctgggc 


agcatagc 










882 
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<210> 13 

<211> 442 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (58) . . (336) 



<400> 13 

gctccggaat tcccgggaac atttacactg acagcaagta cgccttagct actgtgc 57 
atg tac ata aag cca tct acc agg aaa tea ggc tac tea cct cag cag 
Met Tyr He Lys Pro Ser Thr Arg Lys Ser Gly Tyr Ser Pro Gin Gin 
15 10 15 



gec cat agt aac cag aaa get gat tea gca get cag gtc act gee aga 
Ala His Ser Asn Gin Lys Ala Asp Ser Ala Ala Gin Val Thr Ala Arg 
35 40 45 

ctt tea gtc acg cct cca aac ttg ctg ccc aca gtc tec ttt cca cag 
Leu Ser Val Thr Pro Pro Asn Leu Leu Pro Thr Val Ser Phe Pro Gin 
50 55 60 

cca gat ctg cct gac aat ccc gta tac tea aca aca aca gaa aaa ctg 
Pro Asp Leu Pro Asp Asn Pro Val Tyr Ser Thr Thr Thr Glu Lys Leu 
65 70 75 80 

get tea gat etc aga gee aat aaa aat cag gaa agt tag tagattcttc 
Ala Ser Asp Leu Arg Ala Asn Lys Asn Gin Glu Ser * 
85 90 

ctgactctgg aatcttcata ccctgaactt aaaccagtta cctacagtct accacccatt 



105 



gta get gtg ate cac tgc aaa gga cat caa aaa gaa aac acg gee gtt 153 
Val Ala Val He His Cys Lys Gly His Gin Lys Glu Asn Thr Ala Val 
20 25 30 



201 



249 



297 



346 



406 



taagaagagc aaagttacct cagctcctcc ggaggg 442 



<210> 14 

<211> 2058 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (173) . . (1909) 
<400> 14 

taagcttgcg gccgcccggc ggtgtagatg gtacgcttgc ttgtacgcag caggggaggc 
tetgacttge actggctggt ggtccgcttc eggggagage gcccagtgcc cacaggtggc 



42 



aggagcctgt ccagggactc tgcattgctg ctgcagagag ggatgtactc gg atg 175 

Met 
1 

tgg ccc aga get gec cac etc cca ggt acc cgc tgg cat ggc egg ccc 223 
Trp Pro Arg Ala Ala His Leu Pro Gly Thr Arg Trp His Gly Arg Pro 
5 10 15 

aca ggg cat ccc ccc cca ccc cca atg age ctg ccc gee tea gee atg 271 
Thr Gly His Pro Pro Pro Pro Pro Met Ser Leu Pro Ala Ser Ala Met 
20 25 30 

cct gtt gag ggg gtg ggg ggg gat gec ctg tgg gec ggc cat gec age 319 
Pro Val Glu Gly Val Gly Gly Asp Ala Leu Trp Ala Gly His Ala Ser 
35 40 45 

ggg tac ctg gga ggt ggc cag etc tgg gec aca tec gag tac ate cct 367 
Gly Tyr Leu Gly Gly Gly Gin Leu Trp Ala Thr Ser Glu Tyr He Pro 
50 55 60 65 

etc tgc age age aat gca gag tec ctg gac agg etc ctg cca cct gtg 415 
Leu Cys Ser Ser Asn Ala Glu Ser Leu Asp Arg Leu Leu Pro Pro Val 
70 75 80 

ggc act ggg cgc tct ccc egg aag egg acc acc age cag tgc aag tea 463 
Gly Thr Gly Arg Ser Pro Arg Lys Arg Thr Thr Ser Gin Cys Lys Ser 
85 90 95 

gag cct ccc ctg ctg cgt aca age aag cgt acc ate tac acc gee ggg 511 
Glu Pro Pro Leu Leu Arg Thr Ser Lys Arg Thr He Tyr Thr Ala Gly 
100 105 110 

egg ccg ccc tgg tac aat gaa cac ggc acg caa tec aaa gag gee ttc 559 
Arg Pro Pro Trp Tyr Asn Glu His Gly Thr Gin Ser Lys Glu Ala Phe 
115 120 125 

gec ate ggc ttg gga ggc ggc agt gec tct ggg aag acc act gtg gec 607 
Ala He Gly Leu Gly Gly Gly Ser Ala Ser Gly Lys Thr Thr Val Ala 
130 135 140 145 

aga atg ate ate gag gee ctg gat gtg ccc tgg gtg gtc ttg ctg tec 655 
Arg Met He He Glu Ala Leu Asp Val Pro Trp Val Val Leu Leu Ser 
150 155 160 

atg gac tec ttc tac aag gtg ctg act gag cag cag cag gaa cag gee 703 
Met Asp Ser Phe Tyr Lys Val Leu Thr Glu Gin Gin Gin Glu Gin Ala 
165 170 175 

gca cac aac aac ttc aac ttc gac cac cca gat gee ttt gac ttc gac 751 
Ala His Asn Asn Phe Asn Phe Asp His Pro Asp Ala Phe Asp Phe Asp 
180 185 190 

etc ate att tec acc etc aag aag ctg aag cag ggg aag agt gtc aag 799 
Leu He He Ser Thr Leu Lys Lys Leu Lys Gin Gly Lys Ser Val Lys 
195 200 205 
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gtg ccc att tat gac ttc acc acg cac age egg aag aag gac tgg aaa 847 
Val Pro He Tyr Asp Phe Thr Thr His Ser Arg Lys Lys Asp Trp Lys 
210 215 220 225 

aca ctg tat ggt gca aac gtc ate ate ttt gag ggc ate atg gec ttt 895 
Thr Leu Tyr Gly Ala Asn Val He He Phe Glu Gly He Met Ala Phe 
230 235 240 

get gac aag aca ctg ttg gag etc ctg gac atg aag ate ttt gtg gac 943 
Ala Asp Lys Thr Leu Leu Glu Leu Leu Asp Met Lys He Phe Val Asp 
245 250 255 

aca gac tec gac ate cgc ctg gta egg egg ctg cgc egg gac ate agt 991 
Thr Asp Ser Asp He Arg Leu Val Arg Arg Leu Arg Arg Asp He Ser 
260 265 270 

gag cgc ggc egg gac ate gag ggt gtc ate aag cag tac aac aag ttt 103 9 

Glu Arg Gly Arg Asp He Glu Gly Val He Lys Gin Tyr Asn Lys Phe 
275 280 285 

gtc aag ccc tec ttc gac cag tac ate cag ccc acc atg cgc ctg gca 1087 
Val Lys Pro Ser Phe Asp Gin Tyr He Gin Pro Thr Met Arg Leu Ala 
290 295 300 305 

gac ate gtg gtc ccc aga ggg age ggc aac acg gtg gec ate gac ctg 1135 
Asp He Val Val Pro Arg Gly Ser Gly Asn Thr Val Ala He Asp Leu 
310 315 320 

att gtg cag cac gtg cac age cag ctg gag gag cgt gaa etc age gtc 1183 
He Val Gin His Val His Ser Gin Leu Glu Glu Arg Glu Leu Ser Val 
325 330 335 

agg get gcg ctg gee teg gca cac cag tgc cac ccg ctg ccc egg acg 1231 
Arg Ala Ala Leu Ala Ser Ala His Gin Cys His Pro Leu Pro Arg Thr 
340 345 350 

ctg age gtc ctg aag age acg ccg cag gta egg ggc atg cac acc ate 12 79 

Leu Ser Val Leu Lys Ser Thr Pro Gin Val Arg Gly Met His Thr He 
355 360 365 

ate agg gac aag gag acc agt cgc gac gag ttc ate ttc tac tec aag 1327 
He Arg Asp Lys Glu Thr Ser Arg Asp Glu Phe He Phe Tyr Ser Lys 
370 375 380 385 

aga ctg atg egg ctg etc ate gag cac gcg etc tec ttc ctg ccc ttt 1375 
Arg Leu Met Arg Leu Leu He Glu His Ala Leu Ser Phe Leu Pro Phe 
390 395 400 

cag gac tgc gtc gta cag acc ccg cag ggg cag gac tat gcg ggc aag 1423 
Gin Asp Cys Val Val Gin Thr Pro Gin Gly Gin Asp Tyr Ala Gly Lys 
405 410 415 

tgc tat gcg ggg aag cag ate acc ggt gtg tec att ctg cgc gee ggt 1471 
Cys Tyr Ala Gly Lys Gin He Thr Gly Val Ser He Leu Arg Ala Gly 
420 425 430 

gaa acc atg gag ccc gcg ctg cgc get gtg tgc aaa gac gtg cgc ate 1519 



44 



Glu Thr Met Glu Pro Ala Leu Arg Ala Val Cys Lys Asp Val Arg lie 
435 440 445 



ggc acc ate etc ate cag ace aac cag ctt acc ggg gag ccc gag etc 1567 
Gly Thr lie Leu lie Gin Thr Asn Gin Leu Thr Gly Glu Pro Glu Leu 
450 455 460 465 

cac tac ctg agg ctg ccc aag gac ate age gat gac cac gtg ate etc 1615 
His Tyr Leu Arg Leu Pro Lys Asp lie Ser Asp Asp His Val lie Leu 
470 475 480 

atg gac tgc acc gtg tec acg ggc gcg gcg gec atg atg gca gtg cgc 1663 
Met Asp Cys Thr Val Ser Thr Gly Ala Ala Ala Met Met Ala Val Arg 
485 490 495 

gtg etc ctg gac cac gac gtg cct gag gac aag ate ttt ttg ctg teg 1711 
Val Leu Leu Asp His Asp Val Pro Glu Asp Lys lie Phe Leu Leu Ser 
500 505 510 

ctg etc atg gca gag atg ggc gtg cac tea gtg gec tat gca ttt ccg 1759 
Leu Leu Met Ala Glu Met Gly Val His Ser Val Ala Tyr Ala Phe Pro 
515 520 525 

cga gtg aga ate ate acc acg gcg gtg gac aag egg gtc aat gac ctt 1807 
Arg Val Arg lie lie Thr Thr Ala Val Asp Lys Arg Val Asn Asp Leu 
530 535 540 545 

ttc cgc ate ate cca ggc att ggg aac ttt ggc gac cgc tac ttt ggg 1855 
Phe Arg lie lie Pro Gly lie Gly Asn Phe Gly Asp Arg Tyr Phe Gly 
550 555 560 

aca gac gcg gtc ccc gat ggc agt gac gag gag gaa gtg gec tac acg 1903 
Thr Asp Ala Val Pro Asp Gly Ser Asp Glu Glu Glu Val Ala Tyr Thr 
565 570 575 

ggt tag ctgcccagtg agccatcccg tccccaccac cctcctcctg cctcctgacc 1959 
Gly * 



caggactget gaatacaaag atgttaattt ttaaaatgtt actagtataa tttattctat 2019 
gcattttata aaataaataa agctttagaa aaaaaaaaa 2 058 



<210> 15 

<211> 1705 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (545) . . (1672) 
<400> 15 

aaggatcctt aattttatta atcccccccc ccccgaaggc ccccagccgc ctgtgccccg 60 
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cggtgcgcgc cccgctccgg gacctgccgc caccgccgcc ccgccctcgg cggcacccac 12 0 

acccaggcgc gcccgcgcgc gcgcccggcc ccgtccctgc ctggaagcac agctgaagat 180 

ggcgagcccg gcgcctccgg agcacgccga ggagggatgc ccggctcctg ccgccgagga 24 0 

gcaggcgccg ccgtcgccgc caccgcccca ggcatccccc gcagagcggc agcagcagga 3 00 

ggaggaagcg caggaagctg gggcggcgga gggcgcgggg ttgcaggtgg aggaggccgc 360 

gggccgggcg gcggccgcgg taacctggct gctcggggag ccggtgctgt ggctgggctg 42 0 

ccgcgccgac gagctgctga gctggaagag gccgctgcgg agcctgctcg gcttcgtcgc 480 

tgccaacctg ctgttctggt tccttgcatt gactccatgg agagtatatc acctgatttc 540 

cgtc atg ata ctt ggg cgt gtt att atg caa ata ata aag gat atg gtt 589 
Met lie Leu Gly Arg Val lie Met Gin lie lie Lys Asp Met Val 
15 10 15 

ttg tct aga aca aga ggt gca cag ttg tgg aga age etc agt gaa age 63 7 
Leu Ser Arg Thr Arg Gly Ala Gin Leu Trp Arg Ser Leu Ser Glu Ser 
20 25 30 

tgg gaa gtt ate aat tec aaa cca gat gaa aga ccc agg etc age cac 685 
Trp Glu Val lie Asn Ser Lys Pro Asp Glu Arg Pro Arg Leu Ser His 
35 40 45 

tgt att gca gaa tea tgg atg aat ttc age ata ttt ctt caa gaa atg 733 
Cys lie Ala Glu Ser Trp Met Asn Phe Ser lie Phe Leu Gin Glu Met 
50 55 60 



tct ctt ttt aaa cag cag age cct ggc aag ttt tgt etc ctg gtc tgt 
Ser Leu Phe Lys Gin Gin Ser Pro Gly Lys Phe Cys Leu Leu Val Cys 
65 70 75 

agt gtg tgc aca ttt ttt acg ate ttg gga agt tac att cct ggg gtt 
Ser Val Cys Thr Phe Phe Thr He Leu Gly Ser Tyr He Pro Gly Val 
80 85 90 95 

ata etc age tat eta ctg tta ctg tgt gca ttt ttg tgt cca ttg ttt 
He Leu Ser Tyr Leu Leu Leu Leu Cys Ala Phe Leu Cys Pro Leu Phe 
100 105 110 

aaa tgt aat gat att gga caa aaa att tac age aaa att aag tea gtt 
Lys Cys Asn Asp He Gly Gin Lys He Tyr Ser Lys He Lys Ser Val 
115 120 125 



781 



829 



877 



925 



ctg ctg aaa ctg gat ttt gga att gga gaa tat att aat cag aag aaa 973 

Leu Leu Lys Leu Asp Phe Gly He Gly Glu Tyr He Asn Gin Lys Lys 
130 135 140 

cgt gag aga tct gaa gca gat aaa gaa aaa agt cac aaa gat gac agt 1021 

Arg Glu Arg Ser Glu Ala Asp Lys Glu Lys Ser His Lys Asp Asp Ser 
145 150 155 
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gaa tta gac ttt tea get ctt tgt cct aag att age etc acg gtt get 1069 
Glu Leu Asp Phe Ser Ala Leu Cys Pro Lys lie Ser Leu Thr Val Ala 
160 165 170 175 

gee aaa gag tta tct gtg tct gac aca gac gtc tea gag gta tec tgg 1117 
Ala Lys Glu Leu Ser Val Ser Asp Thr Asp Val Ser Glu Val Ser Trp 
180 185 190 

act gat aat ggg acc ttc aac ctt tea gaa gga tac act cca cag aca 1165 
Thr Asp Asn Gly Thr Phe Asn Leu Ser Glu Gly Tyr Thr Pro Gin Thr 
195 200 205 

gac act tct gat gat ctt gac cga ccc agt gag gaa gtt ttc tct aga 1213 
Asp Thr Ser Asp Asp Leu Asp Arg Pro Ser Glu Glu Val Phe Ser Arg 
210 215 220 

gat ctt tea gat ttt cca tct eta gaa aat ggc atg gga aca aat gat 1261 
Asp Leu Ser Asp Phe Pro Ser Leu Glu Asn Gly Met Gly Thr Asn Asp 
225 230 235 

gaa gat gaa tta age ctt ggt ttg ccc act gag etc aag aga aag aag 1309 
Glu Asp Glu Leu Ser Leu Gly Leu Pro Thr Glu Leu Lys Arg Lys Lys 
240 245 250 255 

gaa cag ttg gac agt ggt cac aga cca age aaa gag acg caa tea gca 1357 
Glu Gin Leu Asp Ser Gly His Arg Pro Ser Lys Glu Thr Gin Ser Ala 
260 265 270 

get ggt etc acc ctt cct ctg aac agt gac caa acc ttt cac ctg atg 1405 
Ala Gly Leu Thr Leu Pro Leu Asn Ser Asp Gin Thr Phe His Leu Met 
275 280 285 

age aac ctg get ggg gat gtt ate aca get gca gtg act gca get ate 1453 
Ser Asn Leu Ala Gly Asp Val lie Thr Ala Ala Val Thr Ala Ala lie 
290 295 300 

aaa gac cag tta gag ggt gtg cag caa gca ctt tct cag get gee ccc 15 01 

Lys Asp Gin Leu Glu Gly Val Gin Gin Ala Leu Ser Gin Ala Ala Pro 
305 310 315 

ate cca gaa gag gac aca gac act gaa gaa ggt gat gac ttt gaa eta 1549 
He Pro Glu Glu Asp Thr Asp Thr Glu Glu Gly Asp Asp Phe Glu Leu 
320 325 330 335 

ctt gac cag tea gag ctg gat caa att gag agt gaa ttg gga ctt aca 15 97 

Leu Asp Gin Ser Glu Leu Asp Gin He Glu Ser Glu Leu Gly Leu Thr 
340 345 350 

caa gac cag gaa gca gaa gca cag caa aat aag aag tct tea ggt ttc 1645 
Gin Asp Gin Glu Ala Glu Ala Gin Gin Asn Lys Lys Ser Ser Gly Phe 
355 360 365 

ctt tea aat ctg ctg gga ggc cat taa tctag gaatcagctt gcaacagagc 1697 
Leu Ser Asn Leu Leu Gly Gly His * 
370 375 

acaaaaaa 1705 
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<210> 16 

<211> 1914 

<212> DMA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (154) . . (1593) 



<400> 16 

gggagggtgt atggatgata acgctagtac catccggcct atttaggtga cactatagaa 

caagtttgta caaaaaagca ggctggtacc ggtccggaat tcccgggata tcgtcgaccc 

acgcgtccgg tgcaccgcgt tctcgcacgc gtc atg gcg gtc etc gga gta cag 

Met Ala Val Leu Gly Val Gin 
1 5 



60 
120 
174 



ctg gtg gtg acc ctg etc act gec acc etc atg cac agg ctg gcg cca 
Leu Val Val Thr Leu Leu Thr Ala Thr Leu Met His Arg Leu Ala Pro 
10 15 20 



222 



cac tgc tec ttc gcg cgc tgg ctg etc tgt aac ggc agt ttg ttc cga 
His Cys Ser Phe Ala Arg Trp Leu Leu Cys Asn Gly Ser Leu Phe Arg 
25 30 35 



270 



tac aag cac ccg tct gag gag gag ctt egg gee ctg gcg ggg aag ccg 
Tyr Lys His Pro Ser Glu Glu Glu Leu Arg Ala Leu Ala Gly Lys Pro 
40 45 50 55 



318 



agg ccc aga ggc agg aaa gag egg tgg gec aat ggc ctt agt gag gag 
Arg Pro Arg Gly Arg Lys Glu Arg Trp Ala Asn Gly Leu Ser Glu Glu 
60 65 70 

aag cca ctg tct gtg ccc cga gat gec ccg ttc cag ctg gag acc tgc 
Lys Pro Leu Ser Val Pro Arg Asp Ala Pro Phe Gin Leu Glu Thr Cys 
75 80 85 



366 



414 



ccc etc acg acc gtg gat gec ctg gtc ctg cgc ttc ttc ctg gag tac 
Pro Leu Thr Thr Val Asp Ala Leu Val Leu Arg Phe Phe Leu Glu Tyr 
90 95 100 



462 



cag tgg ttt gtg gac ttt get gtg tac teg ggc ggc gtg tac etc ttc 
Gin Trp Phe Val Asp Phe Ala Val Tyr Ser Gly Gly Val Tyr Leu Phe 
105 110 115 



510 



aca gag gec tac tac tac atg ctg gga cca gee aag gag act aac att 
Thr Glu Ala Tyr Tyr Tyr Met Leu Gly Pro Ala Lys Glu Thr Asn lie 
120 125 130 135 



558 



get gtg ttc tgg tgc ctg etc acg gtg acc ttc tec ate aag atg ttc 
Ala Val Phe Trp Cys Leu Leu Thr Val Thr Phe Ser lie Lys Met Phe 
140 145 150 



606 
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ctg aca gtg aca egg ctg tac ttc age gec gag gag ggg ggt gag cgc 654 
Leu Thr Val Thr Arg Leu Tyr Phe Ser Ala Glu Glu Gly Gly Glu Arg 
155 160 165 

tct gtc tgc etc acc ttt gee ttc etc ttc ctg ctg ctg gee atg ctg 702 
Ser Val Cys Leu Thr Phe Ala Phe Leu Phe Leu Leu Leu Ala Met Leu 
170 175 180 

gtg caa gtg gtg egg gag gag acc etc gag ctg ggc ctg gag cct ggt 75 0 

Val Gin Val Val Arg Glu Glu Thr Leu Glu Leu Gly Leu Glu Pro Gly 
185 190 195 

ctg gee age atg acc cag aac tta gag cca ctt ctg aag aag cag ggc 798 
Leu Ala Ser Met Thr Gin Asn Leu Glu Pro Leu Leu Lys Lys Gin Gly 
200 205 210 215 

tgg gac tgg gcg ctt cct gtg gee aag ctg get ate cgc gtg gga ctg 84 6 

Trp Asp Trp Ala Leu Pro Val Ala Lys Leu Ala lie Arg Val Gly Leu 
220 225 230 

gca gtg gtg ggc tct gtg ctg ggt gee ttc etc acc ttc cca ggc ctg 894 
Ala Val Val Gly Ser Val Leu Gly Ala Phe Leu Thr Phe Pro Gly Leu 
235 240 245 

egg ctg gec cag acc cac egg gac gca ctg acc atg teg gag gac aga 942 
Arg Leu Ala Gin Thr His Arg Asp Ala Leu Thr Met Ser Glu Asp Arg 
250 255 260 

ccc atg ctg cag ttc etc ctg cac acc age ttc ctg tct ccc ctg ttc 990 
Pro Met Leu Gin Phe Leu Leu His Thr Ser Phe Leu Ser Pro Leu Phe 
265 270 275 

ate ctg tgg etc tgg aca aag ccc att gca egg gac ttc ctg cac cag 1038 
lie Leu Trp Leu Trp Thr Lys Pro lie Ala Arg Asp Phe Leu His Gin 
280 285 290 295 

ccg ccg ttt ggg gag acg cgt ttc tec ctg ctg tec gat tct gec ttc 1086 
Pro Pro Phe Gly Glu Thr Arg Phe Ser Leu Leu Ser Asp Ser Ala Phe 
300 305 310 

gac tct ggg cgc etc tgg ttg ctg gtg gtg ctg tgc ctg ctg egg ctg 1134 
Asp Ser Gly Arg Leu Trp Leu Leu Val Val Leu Cys Leu Leu Arg Leu 
315 320 325 

gcg gtg acc egg ccc cac ctg cag gee tac ctg tgc ctg gee aag gee 1182 
Ala Val Thr Arg Pro His Leu Gin Ala Tyr Leu Cys Leu Ala Lys Ala 
330 335 340 

egg gtg gag cag ctg cga agg gag get ggc cgc ate gaa gec cgt gaa 12 30 

Arg Val Glu Gin Leu Arg Arg Glu Ala Gly Arg He Glu Ala Arg Glu 

345 350 355 

ate cag cag agg gtg gtc cga gtc tac tgc tat gtg acc gtg gtg age 12 7 8 

He Gin Gin Arg Val Val Arg Val Tyr Cys Tyr Val Thr Val Val Ser 
360 365 370 375 
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ttg cag tac ctg acg ccg etc ate etc acc etc aac tgc aca ctt ctg 
Leu Gin Tyr Leu Thr Pro Leu lie Leu Thr Leu Asn Cys Thr Leu Leu 
380 385 390 



1326 



etc aag acg ctg gga ggc tat tec tgg ggc ctg ggc cca get cct eta 1374 
Leu Lys Thr Leu Gly Gly Tyr Ser Trp Gly Leu Gly Pro Ala Pro Leu 
395 400 405 

eta tec ccc gac cca tec tea gec age get gec ccc ate ggc tct ggg 1422 
Leu Ser Pro Asp Pro Ser Ser Ala Ser Ala Ala Pro lie Gly Ser Gly 
410 415 420 

gag gac gaa gtc cag cag act gca gcg egg att gec ggg get ctg ggt 1470 
Glu Asp Glu Val Gin Gin Thr Ala Ala Arg lie Ala Gly Ala Leu Gly 
425 430 435 

ggc ctg ctt act ccc etc ttc etc cgt ggc gtc ctg gec tac etc ate 1518 
Gly Leu Leu Thr Pro Leu Phe Leu Arg Gly Val Leu Ala Tyr Leu lie 
440 445 450 455 

tgg tgg acg get gee tgc cag ctg etc gee age ctt ttc ggc etc tac 1566 
Trp Trp Thr Ala Ala Cys Gin Leu Leu Ala Ser Leu Phe Gly Leu Tyr 
460 465 470 

ttc cac cag cac ttg gca ggc tec tag ctgee tgcagaccct cctggggccc 1618 
Phe His Gin His Leu Ala Gly Ser * 
475 480 

tgaggtctgt tcctggggca gegggacact agcctgcccc ctctgtttgc gcccccgtgt 1678 

ccccagctgc aaggtggggc cggactcccc ggcgttccct tcaccacagt gcctgacccg 173 8 

cggcccccct tggaegcega gtttctgect cagaactgtc tctcctgggc ccagcagcat 1798 

gagggtcccg aggecattgt ctccgaagcg tatgtgccag gtttgagtgg cgagggtgat 1858 

getggctget cttctgaaca aataaaggag catgecgatt tttacaaaaa aaaaaa 1914 



<210> 17 

<211> 859 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (61) . . (795) 



<400> 17 

atttggcect egaggecaag aatteggcac gaggaaggag gtcccccacg gcccttcagg 60 

atg aaa get gcg gtg ctg acc ttg gee gtg etc ttc ctg acg ggg age 108 
Met Lys Ala Ala Val Leu Thr Leu Ala Val Leu Phe Leu Thr Gly Ser 
15 10 15 



50 



cag get egg cat ttc tgg cag caa gat gaa ccc ccc cag age ccc tgg 156 
Gin Ala Arg His Phe Trp Gin Gin Asp Glu Pro Pro Gin Ser Pro Trp 
20 25 30 

gat cga gtg aag gac ctg gee act gtg tac gtg gat gtg etc aaa gac 2 04 

Asp Arg Val Lys Asp Leu Ala Thr Val Tyr Val Asp Val Leu Lys Asp 
35 40 45 

age ggc aaa gac age gtg acc tec acc ttc age aag ctg cgc gaa cag 2 52 

Ser Gly Lys Asp Ser Val Thr Ser Thr Phe Ser Lys Leu Arg Glu Gin 
50 55 60 

etc ggc cct gtg acc cag gag ttc tgg gat aac ctg gaa aag gag aca 3 00 

Leu Gly Pro Val Thr Gin Glu Phe Trp Asp Asn Leu Glu Lys Glu Thr 
65 70 75 80 

gag ggc ctg agg cag gag atg age aag gat ctg gag gag gtg aag gee 348 
Glu Gly Leu Arg Gin Glu Met Ser Lys Asp Leu Glu Glu Val Lys Ala 
85 90 95 

aag gtg cag ccc tac ctg gac gac ttc cag aag aag tgg cag gag gag 3 96 

Lys Val Gin Pro Tyr Leu Asp Asp Phe Gin Lys Lys Trp Gin Glu Glu 
100 105 110 

atg gag etc tac cgc cag aag gtg gag ccg ctg cgc gca gag etc caa 444 
Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Leu Arg Ala Glu Leu Gin 
115 120 125 

gag ggc gcg cgc cag aag ctg cac gag ctg caa gag aag ctg age cca 4 92 

Glu Gly Ala Arg Gin Lys Leu His Glu Leu Gin Glu Lys Leu Ser Pro 
130 135 140 

ctg ggc gag gag atg cgc gac cgc gcg cgc gee cat gtg gac gcg ctg 540 
Leu Gly Glu Glu Met Arg Asp Arg Ala Arg Ala His Val Asp Ala Leu 
145 150 155 160 

cgc acg cat ctg gee ccc tac age gac gag ctg cgc cag cgc ttg gee 588 
Arg Thr His Leu Ala Pro Tyr Ser Asp Glu Leu Arg Gin Arg Leu Ala 
165 170 175 

gcg cgc ctt gag get etc aag gag aac ggc ggc gee aga ctg gec gag 636 
Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly Ala Arg Leu Ala Glu 
180 185 190 

tac cac gee aag gec acc gag cat ctg age acg etc age gag aag gee 684 
Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr Leu Ser Glu Lys Ala 
195 200 205 

aag ccc gcg etc gag gac etc cgc caa ggc ctg ctg ccc gtg ctg gag 732 
Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu Leu Pro Val Leu Glu 
210 215 220 

age ttc aag gtc age ttc ctg age get etc gag gag tac act aag aag 780 
Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu Glu Tyr Thr Lys Lys 
225 230 235 240 

etc aac acc cag tga ggcgcccgcc gccgcccccc ttcccggtgc tcagaataaa 835 



51 



Leu Asn Thr Gin * 
245 

cgtttccaaa gtgggaaaaa aaaa 859 



<210> 18 

<211> 2454 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (154) . . (1710) 
<400> 18 

gtgaaaggag ggaacgcagg tgagaaagcg agacaggcag gtagggaaat cgtgaggtga 60 

gcgtgatcct agctccttgt ggcagagcct agagagaagg cgaggacgct gaagaaccag 12 0 

gcggacagct ggcagagaga gaagttggct age atg gaa tea cca gag gag cct 174 

Met Glu Ser Pro Glu Glu Pro 
1 5 

gga gca tec atg gat gag aac tac ttt gtg aac tac act ttc aaa gat 222 
Gly Ala Ser Met Asp Glu Asn Tyr Phe Val Asn Tyr Thr Phe Lys Asp 
10 15 20 

egg tea cat tea ggc cgt gtg get caa ggc ate atg aaa ctg tgt eta 270 
Arg Ser His Ser Gly Arg Val Ala Gin Gly lie Met Lys Leu Cys Leu 
25 30 35 

gag gag gag etc ttt get gat gtc acc att teg gtg gaa ggc egg gag 318 
Glu Glu Glu Leu Phe Ala Asp Val Thr lie Ser Val Glu Gly Arg Glu 
40 45 50 55 

ttt cag etc cat egg ctg gtc etc tea get cag age tgc ttc ttc cga 366 
Phe Gin Leu His Arg Leu Val Leu Ser Ala Gin Ser Cys Phe Phe Arg 
60 65 70 

tec atg ttc act tec aac ctg aag gag gee cac aac egg gtg att gtg 414 
Ser Met Phe Thr Ser Asn Leu Lys Glu Ala His Asn Arg Val lie Val 
75 80 85 

ctg cag gat gtc age gag tct gtt ttc cag etc ctg gtt gat tat ate 462 
Leu Gin Asp Val Ser Glu Ser Val Phe Gin Leu Leu Val Asp Tyr lie 
90 95 100 

tac cat ggg act gtg aaa ctt cga get gag gag ttg cag gaa att tat 510 
Tyr His Gly Thr Val Lys Leu Arg Ala Glu Glu Leu Gin Glu lie Tyr 
105 110 115 

gag gtg tea gac atg tat cag ctg aca tct etc ttt gag gaa tgc tct 558 
Glu Val Ser Asp Met Tyr Gin Leu Thr Ser Leu Phe Glu Glu Cys Ser 
120 125 130 135 



52 



egg ttt ttg gec cgc aca gtg caa gtg gga aac tgc ctt cag gtg atg 606 
Arg Phe Leu Ala Arg Thr Val Gin Val Gly Asn Cys Leu Gin Val Met 
140 145 150 

tgg ctg gca gat egg cac agt gat cct gag etc tat acg get gee aag 654 
Trp Leu Ala Asp Arg His Ser Asp Pro Glu Leu Tyr Thr Ala Ala Lys 
155 160 165 

cac tgt gec aag ace cac ctg gec cag ctg cag aat aca gag gaa ttt 702 
His Cys Ala Lys Thr His Leu Ala Gin Leu Gin Asn Thr Glu Glu Phe 
170 175 180 

etc cac ttg ccc cac cgc tta etc aca gat ate ate teg gat gga gtt 750 
Leu His Leu Pro His Arg Leu Leu Thr Asp lie lie Ser Asp Gly Val 
185 190 195 

ccg tgt tct cag aac cca aca gag gca ata gaa gec tgg ate aac ttt 798 
Pro Cys Ser Gin Asn Pro Thr Glu Ala lie Glu Ala Trp lie Asn Phe 
200 205 210 215 

aat aaa gag gaa aga gag get ttt gca gag tea etc agg aca age ttg 84 6 

Asn Lys Glu Glu Arg Glu Ala Phe Ala Glu Ser Leu Arg Thr Ser Leu 
220 225 230 

aag gaa att ggg gag aat gtg cac att tac ctg att ggg aaa gag tea 894 
Lys Glu lie Gly Glu Asn Val His lie Tyr Leu lie Gly Lys Glu Ser 
235 240 245 

tct cgt acc cac teg ttg get gtg tec ttg cac tgt gca gaa gat gac 942 
Ser Arg Thr His Ser Leu Ala Val Ser Leu His Cys Ala Glu Asp Asp 
250 255 260 

tec ate agt gta agt ggc caa aac agt ttg tgc cac cag ate act gcg 990 
Ser lie Ser Val Ser Gly Gin Asn Ser Leu Cys His Gin lie Thr Ala 
265 270 275 

gec tgc aag cat ggt gga gac ttg tat gtg gtg gga ggg tec ate cca 103 8 
Ala Cys Lys His Gly Gly Asp Leu Tyr Val Val Gly Gly Ser lie Pro 
280 285 290 295 

egg cgc atg tgg aag tgc aac aat gec acc gtt gac tgg gag tgg tgt 1086 
Arg Arg Met Trp Lys Cys Asn Asn Ala Thr Val Asp Trp Glu Trp Cys 
300 305 310 

get cct ttg cct egg gac egg etc cag cac acc ctg gtg tct gtg ccc 1134 
Ala Pro Leu Pro Arg Asp Arg Leu Gin His Thr Leu Val Ser Val Pro 
315 320 325 

ggg aaa gat gec ata tat tea ctg ggt ggc aag aca ctg caa gat acc 1182 
Gly Lys Asp Ala lie Tyr Ser Leu Gly Gly Lys Thr Leu Gin Asp Thr 
330 335 340 

etc tec aac gca gtc att tat tat cgc gta ggt gat aat gtg tgg aca 1230 
Leu Ser Asn Ala Val lie Tyr Tyr Arg Val Gly Asp Asn Val Trp Thr 
345 350 355 



53 



gag aca act cag eta gag gtg get gtg tea ggg get get ggt gee aac 1278 
Glu Thr Thr Gin Leu Glu Val Ala Val Ser Gly Ala Ala Gly Ala Asn 
360 365 370 375 

etc aac ggg ate ate tac tta eta ggg ggg gag gag aat gat ctg gac 1326 
Leu Asn Gly lie lie Tyr Leu Leu Gly Gly Glu Glu Asn Asp Leu Asp 
380 385 390 

ttc ttt ace aaa cct tec cga etc ate cag tgc ttt gac aca gag aca 1374 
Phe Phe Thr Lys Pro Ser Arg Leu lie Gin Cys Phe Asp Thr Glu Thr 
395 400 405 

gac aaa tgc cat gtg aag ccc tat gtg ctg ccc ttt gca ggc cgc atg 1422 
Asp Lys Cys His Val Lys Pro Tyr Val Leu Pro Phe Ala Gly Arg Met 
410 415 420 

cac gca get gtg cat aaa gat ctg gtg ttc ate gtg get gaa ggg gac 1470 
His Ala Ala Val His Lys Asp Leu Val Phe lie Val Ala Glu Gly Asp 
425 430 435 

tec ctg gtg tgc tac aat ccc ttg eta gac age ttc ace egg ctt tgc 1518 
Ser Leu Val Cys Tyr Asn Pro Leu Leu Asp Ser Phe Thr Arg Leu Cys 
440 445 450 455 

ctt cct gag gee tgg age tct gee cca tec etc tgg aag att gee age 1566 
Leu Pro Glu Ala Trp Ser Ser Ala Pro Ser Leu Trp Lys lie Ala Ser 
460 465 470 

tgt aac ggg age ate tat gtc ttc egg gac cga tat aaa aag ggg gat 1614 
Cys Asn Gly Ser lie Tyr Val Phe Arg Asp Arg Tyr Lys Lys Gly Asp 
475 480 485 

gee aac ace tac aag ctt gac cct gee act tea gee gta act gtc aca 1662 
Ala Asn Thr Tyr Lys Leu Asp Pro Ala Thr Ser Ala Val Thr Val Thr 
490 495 500 

aga ggt att aag gtg ctg ctt ace aat ttg cag ttt gtg ttg gee taa 1710 
Arg Gly lie Lys Val Leu Leu Thr Asn Leu Gin Phe Val Leu Ala * 
505 510 515 

ggctgtgggg aggggaggag aactgctcac tccttttccc tccccataca aactcaaagt 1770 

cccctgggcc ccaattcaga gttatgtttt ttttggcaca tactagaaag gcagtgcctc 183 0 

agcccttccc tgaatccatg gaggtgttct gtttggggct ttttagactg ctgctgctca 1890 

gctggttgct tgaactgaca gtaggecage ctgttctctg ccattcccta gtcatcctgt 1950 

gcctcaccac agettgetta gagcaagect tttctcagac cttaggcaca gcctctcctc 2010 

tttacctgat caatgttaaa tgtaagcacc cctgatccca ggacataagg aaagatgece 2 070 

aattgtactt ttgttctata gcctgtgaaa tggctagttg atcatttttc cacaaagaat 2130 

taggtgttaa gagttttcct tcaggcttta cttaggagaa tggactaagc tgaaggtgta 2190 

cttcaccagc aagagtcaac tctagaattc aggatgttcc ttctattgtt ttcttageca 2250 
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tctgtcagga aatgtaactt tggttttatt tttggcttat tccaaggggt aagccagaaa 2310 
atagaaatga ttatttctga ttaatagcag aaactttttc aatctcaaat atataaggtg 2370 
tctgctcttt taaaagctct aagctaagtc aagagctagg aactgttgat acaaataaaa 2430 



<210> 19 

<211> 3586 

<212> DMA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (106) . . (1860) 
<400> 19 

gtaccggtcc ggaattcccg ggtcgacgat ttcgtggagc aggcagacgt cgaagccgtc 60 
ctgcagccga ccctcctgca tctgggcacg ttcaggttgc gacac atg aag get 114 



gtttttgaag 



ggaaaaaaaa aaaa 



2454 



Met Lys Ala 
1 



ttg atg aac gag aag gcc cag gcc gcc ctg gtg gag ttt gtg gag gac 
Leu Met Asn Glu Lys Ala Gin Ala Ala Leu Val Glu Phe Val Glu Asp 
5 10 15 



162 



gtc aat cac get gcc att ccc agg gag ate cca cgc aag gat ggg gtc 
Val Asn His Ala Ala lie Pro Arg Glu lie Pro Arg Lys Asp Gly Val 
20 25 30 35 



210 



tgg agg gtt ctg tgg aag gac cgt gcg cag gac acg agg gtc ctg agg 
Trp Arg Val Leu Trp Lys Asp Arg Ala Gin Asp Thr Arg Val Leu Arg 
40 45 50 
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cag atg aca cgc ctg ctg ctg gat gac ggg ccc acg cag gcc gcg gag 
Gin Met Thr Arg Leu Leu Leu Asp Asp Gly Pro Thr Gin Ala Ala Glu 
55 60 65 
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get ggg acc ccc ggt gag gca ccc acc cct ccc get teg gag acg cag 
Ala Gly Thr Pro Gly Glu Ala Pro Thr Pro Pro Ala Ser Glu Thr Gin 
70 75 80 
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gcc cag gat tct ggt gag gta aca ggg cat get ggc teg ctt ctt ggg 
Ala Gin Asp Ser Gly Glu Val Thr Gly His Ala Gly Ser Leu Leu Gly 
85 90 95 



402 



gca ccc agg aac cca agg agg ggc cgt egg ggt cgc aga aac aga acc 
Ala Pro Arg Asn Pro Arg Arg Gly Arg Arg Gly Arg Arg Asn Arg Thr 
100 105 110 115 



450 



aga cgc aac agg ttg acc cag aag ggc aag aag aga age cga gga gga 



498 



55 



Arg Arg Asn Arg Leu Thr Gin Lys Gly Lys Lys Arg Ser Arg Gly Gly 
120 125 130 



egg ccg tct get ccc gcg agg agt gag gec gag gac tct tec gac gag 546 
Arg Pro Ser Ala Pro Ala Arg Ser Glu Ala Glu Asp Ser Ser Asp Glu 
135 140 145 

age ctg ggc ate gtg ate gag gag ate gac cag ggc gac ctg age gga 594 
Ser Leu Gly He Val He Glu Glu He Asp Gin Gly Asp Leu Ser Gly 
150 155 160 

gaa gag gac cag age gcg ctg tac gec acg ctg cag gec get gec agg 642 
Glu Glu Asp Gin Ser Ala Leu Tyr Ala Thr Leu Gin Ala Ala Ala Arg 
165 170 175 

gag ctg gtt agg cag tgg gcg ccc tgc aac tec gag ggg gaa gaa gac 690 
Glu Leu Val Arg Gin Trp Ala Pro Cys Asn Ser Glu Gly Glu Glu Asp 
180 185 190 195 

ggt ccc cgc gag ttc ttg get ctg gtc acc gtc ace gac aaa teg aag 738 
Gly Pro Arg Glu Phe Leu Ala Leu Val Thr Val Thr Asp Lys Ser Lys 
200 205 210 

aaa gaa gag gca gag aag gag cca get ggg gee gaa tec ate cgc ttg 786 
Lys Glu Glu Ala Glu Lys Glu Pro Ala Gly Ala Glu Ser He Arg Leu 
215 220 225 

aac acc aaa gaa gac aaa aat ggt gtc ccc gac tta gtg gec ctg ctg 834 
Asn Thr Lys Glu Asp Lys Asn Gly Val Pro Asp Leu Val Ala Leu Leu 
230 235 240 

get gtg aga gac acc ccg gac gag gag ccg gtg gac age gac act teg 882 
Ala Val Arg Asp Thr Pro Asp Glu Glu Pro Val Asp Ser Asp Thr Ser 
245 250 255 

gag age gac teg cag gaa agt ggg gac caa gaa aca gag gag ttg gat 93 0 

Glu Ser Asp Ser Gin Glu Ser Gly Asp Gin Glu Thr Glu Glu Leu Asp 
260 265 270 275 

aat cct gag ttc gtg gee att gtg gee tat acc gac ccg teg gac ccc 978 
Asn Pro Glu Phe Val Ala He Val Ala Tyr Thr Asp Pro Ser Asp Pro 
280 285 290 

tgg gec egg gag gag atg ttg aaa ate get tct gtt ate gag teg ctg 1026 
Trp Ala Arg Glu Glu Met Leu Lys He Ala Ser Val He Glu Ser Leu 
295 300 305 

ggc tgg age gac gag aaa gac aag cga gac ccc etc cga cag gtc ttg 1074 
Gly Trp Ser Asp Glu Lys Asp Lys Arg Asp Pro Leu Arg Gin Val Leu 
310 315 320 

tec gtc atg tec aag gac act aac ggg acc cgc gtg aag gtg gaa gag 1122 
Ser Val Met Ser Lys Asp Thr Asn Gly Thr Arg Val Lys Val Glu Glu 
325 330 335 

gcg ggc cgc gag gtg gac gee gtg gtc ctg cgc aag gee ggg gat gac 1170 
Ala Gly Arg Glu Val Asp Ala Val Val Leu Arg Lys Ala Gly Asp Asp 
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340 



345 



350 



355 



ggg gac etc egg gag tgc att tec acc ttg gcg cag ccg gat etc cct 1218 
Gly Asp Leu Arg Glu Cys He Ser Thr Leu Ala Gin Pro Asp Leu Pro 
360 365 370 

ccc cag gcg aag aag get ggg cgt ggc etc ttc ggg ggc tgg age gag 12 66 

Pro Gin Ala Lys Lys Ala Gly Arg Gly Leu Phe Gly Gly Trp Ser Glu 
375 380 385 

cac cgt gag gac gaa ggg ggt ctt ctg gag ctg gtg gcg etc ctg get 1314 
His Arg Glu Asp Glu Gly Gly Leu Leu Glu Leu Val Ala Leu Leu Ala 
390 395 400 

gec cag gac atg gcg gag gtg atg aag gag gaa aaa gaa aac gee tgg 13 62 

Ala Gin Asp Met Ala Glu Val Met Lys Glu Glu Lys Glu Asn Ala Trp 
405 410 415 

gaa ggc ggg aag tac aaa tac ccc aaa ggc aaa ctg ggg gag gta ttg 1410 
Glu Gly Gly Lys Tyr Lys Tyr Pro Lys Gly Lys Leu Gly Glu Val Leu 
420 425 430 435 

gcg etc ctg gec gec egg gag aac atg ggg tec aac gag ggg teg gag 1458 
Ala Leu Leu Ala Ala Arg Glu Asn Met Gly Ser Asn Glu Gly Ser Glu 
440 445 450 

gag get teg gac gaa cag tec gag gag gag teg gag gac acc gag age 1506 
Glu Ala Ser Asp Glu Gin Ser Glu Glu Glu Ser Glu Asp Thr Glu Ser 
455 460 465 

gag gcg teg gag ccg gag gac agg gca tec agg aag ccc egg gec aag 1554 
Glu Ala Ser Glu Pro Glu Asp Arg Ala Ser Arg Lys Pro Arg Ala Lys 
470 475 480 

gcg cgc acg gec ccc agg ggc ctg act ccg gee ggc gcg cct ccc 1602 
Arg Ala Arg Thr Ala Pro Arg Gly Leu Thr Pro Ala Gly Ala Pro Pro 
485 490 495 

acc get tec ggg gee cgc aaa acc cgc gcg ggc ggc cga ggc cga ggc 1650 
Thr Ala Ser Gly Ala Arg Lys Thr Arg Ala Gly Gly Arg Gly Arg Gly 
500 505 510 515 

cga ggc egg ggc gtc act ccc gag aag aaa gee ggg age egg ggc teg 1698 
Arg Gly Arg Gly Val Thr Pro Glu Lys Lys Ala Gly Ser Arg Gly Ser 
520 525 530 

gec cag gac gac gee gca gga age agg aag aag aag ggg age gcg ggc 1746 
Ala Gin Asp Asp Ala Ala Gly Ser Arg Lys Lys Lys Gly Ser Ala Gly 
535 540 545 

tec ggg gee cat gee agg gca ggc gag gee aag ggc cag gcg ccc act 1794 
Ser Gly Ala His Ala Arg Ala Gly Glu Ala Lys Gly Gin Ala Pro Thr 
550 555 560 

gga tec aag gec gcg cgc ggg aag aag gee cgt egg ggc egg agg ctg 1842 
Gly Ser Lys Ala Ala Arg Gly Lys Lys Ala Arg Arg Gly Arg Arg Leu 
565 570 575 
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ccc cct aaa tgc cgc tag tggccc cccaagaagc cgcccaggct gcgagcaggc 1896 
Pro Pro Lys Cys Arg * 
580 585 

cccgcagggc acccgcccgc ctgtggcccc cgccctcccc tcccctcttc ctgtcctccg 1956 

cagacgcaat ctcctcgctt cacagcgcgc ccgggccgcg ttttgccagc gtcacgttcc 2016 

cctctcgggc cctcgcaggc cgggggcgcc agcgatcccg acggaggaag cccggatggg 2 076 

aggaggaaag agaagtgggc gcccgaggca gcagcgcagg gccgagatgg ggacgcgcca 213 6 

agtggaccag gattgggggc ccgggttgcc cccggagggg gtgtgtgtgt ggacgccggg 2196 

cacctgcaga ggcgagcagg gctcttcgtg gcgctctcgg ggcctgcgcc tggcaggtgc 2256 

tgtaggccgc tgtcgcccct accccagtct gactgggccc tgggtctgtg gtggaggctc 2 316 

agtcaccagc cgcgcagcgc gtgtcagggc gcaactctca gccaggggag gccccagctc 2 3 76 

ccagccaggg gaagagatga ttccagaaag gaaagtctga gagatagaag gcggttggga 2436 

aggggaggag gaggaaaggg gagaggaacg gtgggagaag ggaaagagga ggaggagggg 24 96 

gaggggggag cagagggaag acacatgcca gccctgccta ctggggcgcc cctgataaca 2556 

aaggaaccag ccccaggcca aggccacctg cccctgacca caagttgaat ttgtcactca 2 616 

gactgcagtg tttcccaaca ttctaattat ttgcagaggt gttcaatttg gggtaattca 2676 

cttaaaatcc agttttggtt cttctgggct gagtgggccc tggcccctcc cataggctgt 2736 

ggctcccctg ggtgccccct ctccagtgga gctgacccac cgctcagcgc tggccttgca 2796 

gcccttacta aaagacttga aagtccctgg gttcaccccc tgagtgaatt aaaggccaga 2856 

ggggccccga agggcactgt gagggacaga ggctcacctg ggcagtgcag aagccggccg 2 916 

cgtgtccctc cttacagggg atgaaatgac ctggggagga aaccccagcc ctgccctgga 2 976 

ggttccagag taggcgggcc ggtgctgtga ggcttcacaa cctgctgtcc caagcacgct 3 03 6 

tgagttgtat gtgagtctgt gccgtgccgt gccgtatgct tcagctcctg caaccccggc 3 096 

tgagctcgat ttttacctaa atatcagtct ccacgggacc ccaccttcat tcatgccttc 3156 

ttgtccctgg ggcaatgtgt gtgcttcctc gtcccaattt ccattccctg gcagtgagga 3216 

gcccatcgtg ccagggggcc ctgccccact tgtccctggg aaggaatagg agggtttggg 3276 

tgtgacctca cagtccagac cagactgtcc cagtcctatg tcagggacac ccagatgtag 3336 

aagctgactg agacctgctg cagggcgtgg gtgctcccct ctgcttggag gctgtccctg 3396 

gacagtgacc cacccactga ggaccaggct gggtgtacct tgagctgggc acagcagcct 3456 
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gtggtgttgc ctgtgggtgg ggagggcccc aggtgtgctt ctcccgtagc agtcctaggc 
ttctctccct gtgccctgtg tcacctggat cctccagtaa agtgaaattc agcactgtaa 
aaaaaaaaaa 



3516 
3576 
3586 



<210> 20 

<211> 3340 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (47) . . (1894) 

<400> 20 

gatacagttc tgaccatcat tacgccaatc ttggcacgag gggaag atg gcg gag 55 

Met Ala Glu 
1 

tec ggc ggt age age ggt ggt get ggt ggc ggc ggc get ttc ggc gcg 103 
Ser Gly Gly Ser Ser Gly Gly Ala Gly Gly Gly Gly Ala Phe Gly Ala 
5 10 15 

ggc ccg ggc ccc gag cgc ccg aac age acg gec gac aag aac ggg gec 151 
Gly Pro Gly Pro Glu Arg Pro Asn Ser Thr Ala Asp Lys Asn Gly Ala 
20 25 30 35 

etc aag tgc acc ttc teg gca ccc age cac age acc age etc ctg cag 199 
Leu Lys Cys Thr Phe Ser Ala Pro Ser His Ser Thr Ser Leu Leu Gin 
40 45 50 

ggc ctg gec acc etc cgc get cag ggc cag etc etc gat gtt gtg ctg 247 
Gly Leu Ala Thr Leu Arg Ala Gin Gly Gin Leu Leu Asp Val Val Leu 
55 60 65 

act att aac aga gag gec ttt cct gca cac aag gtc gtc ctg get gec 295 
Thr lie Asn Arg Glu Ala Phe Pro Ala His Lys Val Val Leu Ala Ala 
70 75 80 

tgc age gac tac ttc agg gec atg ttc acc ggc ggc atg egg gag gca 34 3 

Cys Ser Asp Tyr Phe Arg Ala Met Phe Thr Gly Gly Met Arg Glu Ala 
85 90 95 

age cag gac gtc ate gag ctg aag ggc gtg teg gee cgt ggc ctg egg 3 91 

Ser Gin Asp Val lie Glu Leu Lys Gly Val Ser Ala Arg Gly Leu Arg 
100 105 110 115 

cac ate ate gac ttc gec tac age gee gag gtg aca ctg gac ctg gac 439 
His lie lie Asp Phe Ala Tyr Ser Ala Glu Val Thr Leu Asp Leu Asp 
120 125 130 

tgc gtg cag gac gtg ctg ggc gcg gec gtg ttc ttg cag atg ctg ccc 487 
Cys Val Gin Asp Val Leu Gly Ala Ala Val Phe Leu Gin Met Leu Pro 
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135 



140 



145 



gtg gtg gag ctg tgc gag gag ttc ctg aag gcg gcc atg age gtg gag 535 
Val Val Glu Leu Cys Glu Glu Phe Leu Lys Ala Ala Met Ser Val Glu 
150 155 160 

acc tgc etc aac ate ggc cag atg gcc acc acc ttc age ctg gcc teg 583 
Thr Cys Leu Asn lie Gly Gin Met Ala Thr Thr Phe Ser Leu Ala Ser 
165 170 175 

ctg cga gag teg gtg gat gcc ttc acc ttc egg cac ttc ctg cag ate 631 
Leu Arg Glu Ser Val Asp Ala Phe Thr Phe Arg His Phe Leu Gin lie 
180 185 190 195 

gcc gag gag gag gat ttc ctg cgc ctg cca ctg gag cgc ctg gtc ttc 679 
Ala Glu Glu Glu Asp Phe Leu Arg Leu Pro Leu Glu Arg Leu Val Phe 
200 205 210 

ttc ctg cag age aac egg ctg cag age tgt gcc gag ate gac ctg ttc 727 
Phe Leu Gin Ser Asn Arg Leu Gin Ser Cys Ala Glu lie Asp Leu Phe 
215 220 225 

cgc gcg gcc gtc cgc tgg ctg cag cat gac ccg gcc egg egg ccg cgc 775 
Arg Ala Ala Val Arg Trp Leu Gin His Asp Pro Ala Arg Arg Pro Arg 
230 235 240 

gcc age cac gtg etc tgc cac att cgc ttc ccg etc atg cag teg tec 823 
Ala Ser His Val Leu Cys His lie Arg Phe Pro Leu Met Gin Ser Ser 
245 250 255 

gag ctg gtg gac age gtg cag acg ctg gac ate atg gtg gag gac gtg 871 
Glu Leu Val Asp Ser Val Gin Thr Leu Asp lie Met Val Glu Asp Val 
260 265 270 275 

ctg tgc cgc cag tat ctg ctg gag gcc ttc aac tac cag gtg ctg ccc 919 
Leu Cys Arg Gin Tyr Leu Leu Glu Ala Phe Asn Tyr Gin Val Leu Pro 
280 285 290 

ttc egg cag cac gag atg cag tct ccg cgc acc gcc gtg cgc teg gat 967 
Phe Arg Gin His Glu Met Gin Ser Pro Arg Thr Ala Val Arg Ser Asp 
295 300 305 

gtg ccc teg etc gtc acc ttc ggc ggc acg ccc tac acc gac age gac 1015 
Val Pro Ser Leu Val Thr Phe Gly Gly Thr Pro Tyr Thr Asp Ser Asp 
310 315 320 

cgc teg gtc age age aag gtc tac cag ctg cct gag ccg gga gcc cgc 1063 
Arg Ser Val Ser Ser Lys Val Tyr Gin Leu Pro Glu Pro Gly Ala Arg 
325 330 335 

cac ttc cgc gag etc acg gag atg gag gta ggc tgc age cac acg tgc 1111 
His Phe Arg Glu Leu Thr Glu Met Glu Val Gly Cys Ser His Thr Cys 
340 345 350 355 

gtg gcc gtg ctg gac aat ttt gtg tac gtg gcc ggg ggg cag cac ctg 1159 
Val Ala Val Leu Asp Asn Phe Val Tyr Val Ala Gly Gly Gin His Leu 
360 365 370 
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cag tac cgc age ggc gag ggc gca gtg gac gec tgc tac cgc tac gac 12 07 

Gin Tyr Arg Ser Gly Glu Gly Ala Val Asp Ala Cys Tyr Arg Tyr Asp 
375 380 385 

ccc cac ctg aat cgc tgg ctg cgc ctg cag gec atg cag gaa age cgc 1255 
Pro His Leu Asn Arg Trp Leu Arg Leu Gin Ala Met Gin Glu Ser Arg 
390 395 400 

ate cag ttc cag ctg aac gtg ctg tgc ggc atg gtg tac gec acg ggc 13 03 

lie Gin Phe Gin Leu Asn Val Leu Cys Gly Met Val Tyr Ala Thr Gly 
405 410 415 

ggc cgc aac cga gec ggc age ctg gec tec gtg gag egg tac tgc ccc 13 51 

Gly Arg Asn Arg Ala Gly Ser Leu Ala Ser Val Glu Arg Tyr Cys Pro 
420 425 430 435 

c 99 C 9 C aat gag tgg ggc tac gec tgc teg ctg aag cgc cgt acc tgg 13 99 

Arg Arg Asn Glu Trp Gly Tyr Ala Cys Ser Leu Lys Arg Arg Thr Trp 
440 445 450 

ggc cat get ggg gee gee tea ggg ggc cgc etc tac ate teg ggt ggc 1447 
Gly His Ala Gly Ala Ala Ser Gly Gly Arg Leu Tyr He Ser Gly Gly 
455 460 465 

tac ggg ate tea gtg gag gac aag aag gee ctg cac tgc tac gac ccc 14 95 

Tyr Gly He Ser Val Glu Asp Lys Lys Ala Leu His Cys Tyr Asp Pro 
470 475 480 

gtg gee gac cag tgg gag ttc aag gcg ccc atg age gaa ccc cgc gtg 1543 
Val Ala Asp Gin Trp Glu Phe Lys Ala Pro Met Ser Glu Pro Arg Val 
485 490 495 

eta cac gee atg gtg ggt gee ggc ggc cgc ate tat gec etc ggg ggc 15 91 

Leu His Ala Met Val Gly Ala Gly Gly Arg He Tyr Ala Leu Gly Gly 
500 505 510 515 

cgc atg gac cac gcg gac cgc tgc ttc gac gtg ctg get gtg gag tac 1639 
Arg Met Asp His Ala Asp Arg Cys Phe Asp Val Leu Ala Val Glu Tyr 
520 525 530 

tat gtg ccg gag acg gac cag tgg acc age gtg age ccc atg egg gec 16 87 

Tyr Val Pro Glu Thr Asp Gin Trp Thr Ser Val Ser Pro Met Arg Ala 
535 540 545 

ggc cag tea gag gee ggc tgc tgc ctg ctg gag agg aag ate tac ate 1735 
Gly Gin Ser Glu Ala Gly Cys Cys Leu Leu Glu Arg Lys He Tyr He 
550 555 560 

gtc ggg ggc tac aac tgg cgt etc aac aac gtc acg ggc ate gta cag 1783 
Val Gly Gly Tyr Asn Trp Arg Leu Asn Asn Val Thr Gly He Val Gin 
565 570 575 

gtg tac aac acg gac acc gac gag tgg gag egg gac ctg cac ttc ccg 1831 
Val Tyr Asn Thr Asp Thr Asp Glu Trp Glu Arg Asp Leu His Phe Pro 
580 585 590 595 
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gag tec ttc gca ggc ata gec tgc gec ccc gtc ctg ctg ccc egg gec 1879 
Glu Ser Phe Ala Gly He Ala Cys Ala Pro Val Leu Leu Pro Arg Ala 
600 605 610 

ggg acc agg agg tag cccccaagac ccccgggacc ctggcctgac cgcatgttgt 1934 
Gly Thr Arg Arg * 
615 

ctccaagtgg ggcttggcga atgeaegtet gectgagaac cccagtgccc cccttcgccc 1994 

gggctgccct tgaggggect gctgcgttga taagcccccc tcccaggggt ccctccctcc 2 054 

ctccttccca aagcagatcc tggctgegag tccatccgag ggagcctgcc ggcaaagcgt 2114 

ctgacatgtg gtggcagcaa attcgtcccc ggggtggttt cctcgcctgg cccccgagtc 2174 

cccacgggct ggcgggtgga atcccaggtc tccagggggt ccctgtgcag ctccatctca 2234 

cttctctact gcctcccagc cccacggttt caggcattca gatgtgagct catcaacatt 2294 

gaacccaaag tcggtggtat atgactcacc ctccttccaa gtctcctgcg cgcgtgtttt 2354 

taaaataaac tcacccgaaa cgtccgtaac acacggacct ccaggagcag tgagaggtgg 2 414 

ettagaagee cctggtttgg ggtgggtgga ggaggagggc acgtgtctgc ctcccctggg 2474 

gtgccctcct tcccccatcc caagctgetg aggggaggee ctggtcatgc ctcagttcct 2534 

gccttcatct getttcegga ggaaaaacca tatcaactcc tagaaacget ccttaggggc 2594 

ttgggacctt ccatttggca ctgagcatct tgtggggcct taactggctg agacatcccg 2 654 

gccctctaca tttgccctgt tggecaggea gtccccttcc cgcaattgga gggegacget 2714 

aacttcagaa tcccatagtg gcgcttgccg caggtctggt ggggtgtctt ttttctcctc 2774 

cctcctttcc acccctccgc gccctgccac tccctggcct gccctgtttt tgggtcaaca 2834 

ttgctacgga gecagcagtg aggcctttcc cttcaagggc tctgtggtat ctctggccac 2894 

atttgttcta atgtctgaac ctctaaatct tttctttttg attggtttta ctgtttttaa 2954 

gaagecagea ctgctgtctc atagatggga tttgtactct tggggcaact taaagtgtct 3014 

ctctcgctgc taacagacga ttgatgtctt gtctctgtga cccactcacc atgtaaagaa 3074 

ttaacctcct atcttagcag acatcgtctc ctaatatttc cctttattta ataaaaatgt 3134 

tatggtgaag agatggagee ggcccagcac tgagcttgtg cggcttgggt ctgattggtc 3194 

acagattcct cgtgtgtcct ccgcgtgtct gggggctcct ctcccccgcc tcagcctccc 3254 

cccgcatccc cccaagaaag gaaaattatt tttcgtattg taaactttaa acatgaaaaa 3314 

gctgttttta atttaaaaaa aaaaaa 3340 
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<210> 21 
<211> 2547 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (143) . . (2275) 
<400> 21 

aaggatcctt aattaaatta atcccccccc cccccccgcc cccgcggtgg cggcggcggc 60 

ggcggtggct gccgtggcgg ctgagagtcc agagccggac gttccggccg cttcgggctg 120 

gcggctggag agcgctcggg tc atg tct gcc cag ggg gac tgc gag ttc ctg 172 

Met Ser Ala Gin Gly Asp Cys Glu Phe Leu 
15 10 

gtg cag cga gcc egg gag ttg gtg ccg caa gac ctg tgg gca gcc aag 220 
Val Gin Arg Ala Arg Glu Leu Val Pro Gin Asp Leu Trp Ala Ala Lys 
15 20 25 

gcg tgg ctg ate acg gcc cgc age etc tac ccg gca gac ttt aac ate 
Ala Trp Leu He Thr Ala Arg Ser Leu Tyr Pro Ala Asp Phe Asn He 
30 35 40 

cag tat gag atg tac acc ate gag egg aat gca gag egg ace gcc ace 316 
Gin Tyr Glu Met Tyr Thr He Glu Arg Asn Ala Glu Arg Thr Ala Thr 
45 50 55 

gcc ggg agg ctg ctg tac gac atg ttt gtg aat ttc cca gac cag ccg 3 64 

Ala Gly Arg Leu Leu Tyr Asp Met Phe Val Asn Phe Pro Asp Gin Pro 
60 65 70 

gtg gtg tgg aga gaa ate age att att aca tea gca tta agg aac gat 412 
Val Val Trp Arg Glu He Ser He He Thr Ser Ala Leu Arg Asn Asp 
75 80 85 90 



tea cag gac aaa caa acc caa ttt tta aga agt tta ttt gaa act ctt 
Ser Gin Asp Lys Gin Thr Gin Phe Leu Arg Ser Leu Phe Glu Thr Leu 
95 100 105 

cct ggt egg gtc cag tgt gaa atg tta eta aag gtc acg gaa caa tgc 
Pro Gly Arg Val Gin Cys Glu Met Leu Leu Lys Val Thr Glu Gin Cys 
110 115 120 



ttc cct gaa acg gtg gtg cag cat ggg gtt ggc ctt ggg gag gca eta 
Phe Pro Glu Thr Val Val Gin His Gly Val Gly Leu Gly Glu Ala Leu 
140 145 150 

tta gag get gaa act att gaa gaa caa gaa tct cca gtg aac tgc ttt 



268 



460 



508 



ttc aac acg tta gaa cga tea gaa atg ttg ctt eta ctt ttg agg cgc 556 
Phe Asn Thr Leu Glu Arg Ser Glu Met Leu Leu Leu Leu Leu Arg Arg 
125 130 135 



604 



652 



63 



Leu Glu Ala Glu Thr He Glu Glu Gin Glu Ser Pro Val Asn Cys Phe 
155 160 165 170 



aga aaa tta ttt gtt tgt gat gtc ctt cct eta ata att aac aac cat 700 
Arg Lys Leu Phe Val Cys Asp Val Leu Pro Leu He He Asn Asn His 
175 180 185 

gat gtt cga tta cct gec aat tta ttg tat aag tac ttg aac aaa gca 748 
Asp Val Arg Leu Pro Ala Asn Leu Leu Tyr Lys Tyr Leu Asn Lys Ala 
190 195 200 

get gaa ttt tat ate aat tat gtc act agg tct act caa ata gaa aat 796 
Ala Glu Phe Tyr He Asn Tyr Val Thr Arg Ser Thr Gin He Glu Asn 
205 210 215 

cag cat caa ggc gec cag gat aca tct gat tta atg tea cct age aaa 844 
Gin His Gin Gly Ala Gin Asp Thr Ser Asp Leu Met Ser Pro Ser Lys 
220 225 230 

cgt age tct cag aag tac ata ata gaa ggg ctg acg gaa aaa tea tec 8 92 

Arg Ser Ser Gin Lys Tyr He He Glu Gly Leu Thr Glu Lys Ser Ser 
235 240 245 250 

cag ate gtg gac cct tgg gag agg ttg ttt aag att ttg aat gtt gtt 94 0 

Gin He Val Asp Pro Trp Glu Arg Leu Phe Lys He Leu Asn Val Val 
255 260 265 

gga atg aga tgt gaa tgg cag atg gat aaa gga aga cga age tat gga 988 
Gly Met Arg Cys Glu Trp Gin Met Asp Lys Gly Arg Arg Ser Tyr Gly 
270 275 280 

gat att ttg cat aga atg aag gat etc tgc aga tac atg aac aac ttt 1036 
Asp He Leu His Arg Met Lys Asp Leu Cys Arg Tyr Met Asn Asn Phe 
285 290 295 

gat agt gaa gca cat gca aaa tat aaa aac caa gtg gtg tat tec ace 1084 
Asp Ser Glu Ala His Ala Lys Tyr Lys Asn Gin Val Val Tyr Ser Thr 
300 305 310 

atg ctg gtc ttc ttt aag aat gca ttc cag tat gtc aac age ata cag 1132 
Met Leu Val Phe Phe Lys Asn Ala Phe Gin Tyr Val Asn Ser He Gin 
315 320 325 330 

cca tct etc ttc caa ggt cct aat gee ccg age caa gtt cca ctg gtt 1180 
Pro Ser Leu Phe Gin Gly Pro Asn Ala Pro Ser Gin Val Pro Leu Val 
335 340 345 

ctt ctt gaa gat gta teg aat gtg tat ggt gat gta gaa att gat cgt 1228 
Leu Leu Glu Asp Val Ser Asn Val Tyr Gly Asp Val Glu He Asp Arg 
350 355 360 

aat aaa cac ate cat aaa aag agg aaa eta get gaa gga aga gaa aaa 12 76 

Asn Lys His He His Lys Lys Arg Lys Leu Ala Glu Gly Arg Glu Lys 
365 370 375 

ace atg agt tea gac gat gaa gac tgt teg gcg aaa gga aga aat cgt 1324 
Thr Met Ser Ser Asp Asp Glu Asp Cys Ser Ala Lys Gly Arg Asn Arg 



64 



380 385 390 

cac att gta gtc aat aaa gcc gaa ctt get aac tec act gaa gtg tta 1372 
His lie Val Val Asn Lys Ala Glu Leu Ala Asn Ser Thr Glu Val Leu 
395 400 405 410 

gaa age ttt aaa ttg gcc agg gag age tgg gag ttg etc tat tec eta 1420 
Glu Ser Phe Lys Leu Ala Arg Glu Ser Trp Glu Leu Leu Tyr Ser Leu 
415 420 425 



gaa ttc ctt gac aaa gaa ttt aca agg att tgc ttg gcc tgg aag acg 
Glu Phe Leu Asp Lys Glu Phe Thr Arg He Cys Leu Ala Trp Lys Thr 
430 435 440 



gag cat cag agg gcg etc ate cag ctg gcg acg tgc cac ttt gcg eta 
Glu His Gin Arg Ala Leu He Gin Leu Ala Thr Cys His Phe Ala Leu 
495 500 505 

ggg gag tac aga atg aca tgt gaa aaa gtc ctt gat ttg atg tgc tac 
Gly Glu Tyr Arg Met Thr Cys Glu Lys Val Leu Asp Leu Met Cys Tyr 
510 515 520 



ctt ttc ctg aaa get gtc aat aaa att tgc caa caa gga aat ttc caa 
Leu Phe Leu Lys Ala Val Asn Lys He Cys Gin Gin Gly Asn Phe Gin 
605 610 615 



1468 



gat act tgg ctt tgg tta aga ate ttc etc act gat atg ate ate tat 1516 
Asp Thr Trp Leu Trp Leu Arg He Phe Leu Thr Asp Met He He Tyr 
445 450 455 

cag ggt caa tat aaa aag gcg ata gcc age ctg cat cac tta gca get 1564 
Gin Gly Gin Tyr Lys Lys Ala He Ala Ser Leu His His Leu Ala Ala 
460 465 470 

etc cag gga tec att tct cag cca cag ate aca ggg cag ggg acc ctg 1612 
Leu Gin Gly Ser He Ser Gin Pro Gin He Thr Gly Gin Gly Thr Leu 
475 480 485 490 



1660 



1708 



atg gta etc ccc att caa gat gga ggc aaa tec cag gag gaa ccc teg 1756 

Met Val Leu Pro He Gin Asp Gly Gly Lys Ser Gin Glu Glu Pro Ser 
525 530 535 

aaa gta aag ccc aaa ttt aga aaa ggt teg gat ctg aag etc ctg cct 1804 

Lys Val Lys Pro Lys Phe Arg Lys Gly Ser Asp Leu Lys Leu Leu Pro 
540 545 550 

tgt acc age aag get ate atg cca tac tgc etc cat tta atg tta gcc 1852 

Cys Thr Ser Lys Ala He Met Pro Tyr Cys Leu His Leu Met Leu Ala 

555 560 565 570 

tgt ttt aag ctt aga get ttc aca gac aac aga gac gac atg gca ttg 1900 

Cys Phe Lys Leu Arg Ala Phe Thr Asp Asn Arg Asp Asp Met Ala Leu 
575 580 585 

ggg cat gtg att gtg ttg ctt cag caa gag tgg cca egg ggc gag aat 194 8 

Gly His Val He Val Leu Leu Gin Gin Glu Trp Pro Arg Gly Glu Asn 
590 595 600 



1996 



65 



tat gag aat ttt ttc aat tac gtt aca aat att gat atg ctg gag gaa 
Tyr Glu Asn Phe Phe Asn Tyr Val Thr Asn lie Asp Met Leu Glu Glu 
620 625 630 

ttt gcc tac ttg aga act cag gaa ggt ggg aaa att cat ctg gaa tta 
Phe Ala Tyr Leu Arg Thr Gin Glu Gly Gly Lys He His Leu Glu Leu 
635 640 645 650 

eta ccc aat caa gga atg ctg ate aag cac cac act gta act cga ggc 
Leu Pro Asn Gin Gly Met Leu He Lys His His Thr Val Thr Arg Gly 
655 660 665 

ate acc aaa ggc gtg aag gag gac ttt cgc ctg gcc atg gag cgc cag 
He Thr Lys Gly Val Lys Glu Asp Phe Arg Leu Ala Met Glu Arg Gin 
670 675 680 



2044 



2092 



2140 



2188 



gtc tec cgc tgt gga gag aat ctg atg gtg gtt ctg cac agg ttc tgc 2236 
Val Ser Arg Cys Gly Glu Asn Leu Met Val Val Leu His Arg Phe Cys 
685 690 695 

att aat gag aag ate ttg etc ctt cag act ctg acc tga gtggagacct 2285 
He Asn Glu Lys He Leu Leu Leu Gin Thr Leu Thr * 
700 705 710 

ttccaccaga cacagctcgg gcctgtgtaa ttgtaggaga agacactcag cagtgattgc 2345 

catggcacag agccgtggtc attgttgctg ttacaaagaa gaaaaccatc tgagttctaa 2405 

ctccttggtt gcttaaaagt agttcccaag agtctgagaa gctatttcta tttttaagag 2465 

tcattttttg taatttttgt aaaacaaaag taccaatctg ttttgtaaat aaaaatcatc 2525 

ctaaaatttg aaaaaaaaaa aa 2547 



<210> 22 

<211> 2625 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (143) . . (2353) 
<400> 22 

aaggatcctt aattaaatta atcccccccc cccccccgcc cccgcggtgg cggcggcggc 

ggcggtggct gccgtggcgg ctgagagtcc agagceggae gttccggccg ettegggctg 

gcggctggag agegcteggg tc atg tct gcc cag ggg gac tgc gag ttc ctg 

Met Ser Ala Gin Gly Asp Cys Glu Phe Leu 
15 10 

gtg cag cga gcc egg gag ttg gtg ccg caa gac ctg tgg gca gcc aag 



66 



Val Gin Arg Ala Arg Glu Leu Val Pro Gin Asp Leu Trp Ala Ala Lys 
15 20 25 



gcg tgg ctg ate acg gec cgc age etc tac ccg gca gac ttt aac ate 
Ala Trp Leu He Thr Ala Arg Ser Leu Tyr Pro Ala Asp Phe Asn He 
30 35 40 



268 



cag tat gag atg tac acc ate gag egg aat gca gag egg acc gee acc 316 
Gin Tyr Glu Met Tyr Thr He Glu Arg Asn Ala Glu Arg Thr Ala Thr 
45 50 55 

gee ggg agg ctg ctg tac gac atg ttt gtg aat ttc cca gac cag ccg 3 64 

Ala Gly Arg Leu Leu Tyr Asp Met Phe Val Asn Phe Pro Asp Gin Pro 
60 65 70 

gtg gtg tgg aga gaa ate age att att aca tea gca tta agg aac gat 412 
Val Val Trp Arg Glu He Ser He He Thr Ser Ala Leu Arg Asn Asp 
75 80 85 90 



tea cag gac aaa caa acc caa ttt tta aga agt tta ttt gaa act ctt 
Ser Gin Asp Lys Gin Thr Gin Phe Leu Arg Ser Leu Phe Glu Thr Leu 
95 100 105 

cct ggt egg gtc cag tgt gaa atg tta eta aag gtc acg gaa caa tgc 
Pro Gly Arg Val Gin Cys Glu Met Leu Leu Lys Val Thr Glu Gin Cys 
110 115 120 



ttc cct gaa acg gtg gtg cag cat ggg gtt ggc ctt ggg gag gca eta 
Phe Pro Glu Thr Val Val Gin His Gly Val Gly Leu Gly Glu Ala Leu 
140 145 150 

tta gag get gaa act att gaa gaa caa gaa tct cca gtg aac tgc ttt 
Leu Glu Ala Glu Thr He Glu Glu Gin Glu Ser Pro Val Asn Cys Phe 
155 160 165 170 

aga aaa tta ttt gtt tgt gat gtc ctt cct eta ata att aac aac cat 
Arg Lys Leu Phe Val Cys Asp Val Leu Pro Leu He He Asn Asn His 
175 180 185 

gat gtt cga tta cct gec aat tta ttg tat aag tac ttg aac aaa gca 
Asp Val Arg Leu Pro Ala Asn Leu Leu Tyr Lys Tyr Leu Asn Lys Ala 
190 195 200 

get gaa ttt tat ate aat tat gtc act agg tct act caa ata gaa aat 
Ala Glu Phe Tyr He Asn Tyr Val Thr Arg Ser Thr Gin He Glu Asn 
205 210 215 

cag cat caa ggc gee cag gat aca tct gat tta atg tea cct age aaa 
Gin His Gin Gly Ala Gin Asp Thr Ser Asp Leu Met Ser Pro Ser Lys 
220 225 230 

cgt age tct cag aag tac ata ata gaa ggg ctg acg gaa aaa tea tec 
Arg Ser Ser Gin Lys Tyr He He Glu Gly Leu Thr Glu Lys Ser Ser 



460 



508 



ttc aac acg tta gaa cga tea gaa atg ttg ctt eta ctt ttg agg cgc 556 
Phe Asn Thr Leu Glu Arg Ser Glu Met Leu Leu Leu Leu Leu Arg Arg 
125 130 135 



604 



652 



700 



748 



796 



844 



892 



67 



235 



240 



245 



250 



cag ate gtg gac cct tgg gag agg ttg ttt aag att ttg aat gtt gtt 94 0 

Gin lie Val Asp Pro Trp Glu Arg Leu Phe Lys lie Leu Asn Val Val 
255 260 265 

gga atg aga tgt gaa tgg cag atg gat aaa gga aga cga age tat gga 
Gly Met Arg Cys Glu Trp Gin Met Asp Lys Gly Arg Arg Ser Tyr Gly 
270 275 280 

gat att ttg cat aga atg aag gat etc tgc aga tac atg aac aac ttt 
Asp He Leu His Arg Met Lys Asp Leu Cys Arg Tyr Met Asn Asn Phe 
285 290 295 

gat agt gaa gca cat gca aaa tat aaa aac caa gtg gtg tat tec ace 
Asp Ser Glu Ala His Ala Lys Tyr Lys Asn Gin Val Val Tyr Ser Thr 
300 305 310 

atg ctg gtc ttc ttt aag aat gca ttc cag tat gtc aac age ata cag 1132 
Met Leu Val Phe Phe Lys Asn Ala Phe Gin Tyr Val Asn Ser He Gin 
315 320 325 330 

cca tct etc ttc caa ggt cct aat gee ccg age caa gtt cca ctg gtt 1180 
Pro Ser Leu Phe Gin Gly Pro Asn Ala Pro Ser Gin Val Pro Leu Val 
335 340 345 



ctt ctt gaa gat gta teg aat gtg tat ggt gat gta gaa att gat cgt 
Leu Leu Glu Asp Val Ser Asn Val Tyr Gly Asp Val Glu He Asp Arg 
350 355 360 



988 



1036 



1084 



1228 



aat aaa cac ate cat aaa aag agg aaa eta get gaa gga aga gaa aaa 12 76 

Asn Lys His He His Lys Lys Arg Lys Leu Ala Glu Gly Arg Glu Lys 
365 370 375 

ace atg agt tea gac gat gaa gac tgt teg gcg aaa gga aga aat cgt 1324 
Thr Met Ser Ser Asp Asp Glu Asp Cys Ser Ala Lys Gly Arg Asn Arg 
380 385 390 

cac att gta gtc aat aaa gee gaa ctt get aac tec act gaa gtg tta 1372 
His He Val Val Asn Lys Ala Glu Leu Ala Asn Ser Thr Glu Val Leu 
395 400 405 410 

gaa age ttt aaa ttg gee agg gag age tgg gag ttg etc tat tec eta 142 0 

Glu Ser Phe Lys Leu Ala Arg Glu Ser Trp Glu Leu Leu Tyr Ser Leu 
415 420 425 

gaa ttc ctt gac aaa gaa ttt aca agg att tgc ttg gee tgg aag acg 1468 
Glu Phe Leu Asp Lys Glu Phe Thr Arg He Cys Leu Ala Trp Lys Thr 
430 435 440 

gat act tgg ctt tgg tta aga ate ttc etc act gat atg ate ate tat 1516 
Asp Thr Trp Leu Trp Leu Arg He Phe Leu Thr Asp Met He He Tyr 
445 450 455 

cag ggt caa tat aaa aag gcg ata gee age ctg cat cac tta gca get 1564 
Gin Gly Gin Tyr Lys Lys Ala He Ala Ser Leu His His Leu Ala Ala 
460 465 470 



68 



etc cag gga tec att tct cag cca cag ate aca ggg cag ggg acc ctg 1612 
Leu Gin Gly Ser He Ser Gin Pro Gin He Thr Gly Gin Gly Thr Leu 
475 480 485 490 



ctt ttc ctg aaa get gtc aat aaa att tgc caa caa gga aat ttc caa 
Leu Phe Leu Lys Ala Val Asn Lys He Cys Gin Gin Gly Asn Phe Gin 
605 610 615 

tat gag aat ttt ttc aat tac gtt aca aat att gat atg ctg gag gaa 
Tyr Glu Asn Phe Phe Asn Tyr Val Thr Asn He Asp Met Leu Glu Glu 
620 625 630 

ttt gee tac ttg aga act cag gaa ggt ggg aaa att cat ctg gaa tta 
Phe Ala Tyr Leu Arg Thr Gin Glu Gly Gly Lys He His Leu Glu Leu 
635 640 645 650 

eta ccc aat caa gga atg ctg ate aag cct tct age cct ccc atg ggg 
Leu Pro Asn Gin Gly Met Leu He Lys Pro Ser Ser Pro Pro Met Gly 
655 660 665 

tta ctg cag cag gaa ttc tta cct gtg ctt cag ccc age ata cag act 
Leu Leu Gin Gin Glu Phe Leu Pro Val Leu Gin Pro Ser He Gin Thr 
670 675 680 

get gac agg cac cac act gta act cga ggc ate acc aaa ggc gtg aag 
Ala Asp Arg His His Thr Val Thr Arg Gly He Thr Lys Gly Val Lys 
685 690 695 



gag cat cag agg gcg etc ate cag ctg gcg acg tgc cac ttt gcg eta 1660 
Glu His Gin Arg Ala Leu He Gin Leu Ala Thr Cys His Phe Ala Leu 
495 500 505 

ggg gag tac aga atg aca tgt gaa aaa gtc ctt gat ttg atg tgc tac 1708 
Gly Glu Tyr Arg Met Thr Cys Glu Lys Val Leu Asp Leu Met Cys Tyr 
510 515 520 

atg gta etc ccc att caa gat gga ggc aaa tec cag gag gaa ccc teg 1756 
Met Val Leu Pro He Gin Asp Gly Gly Lys Ser Gin Glu Glu Pro Ser 
525 530 535 

aaa gta aag ccc aaa ttt aga aaa ggt teg gat ctg aag etc ctg cct 1804 
Lys Val Lys Pro Lys Phe Arg Lys Gly Ser Asp Leu Lys Leu Leu Pro 
540 545 550 

tgt acc age aag get ate atg cca tac tgc etc cat tta atg tta gee 1852 
Cys Thr Ser Lys Ala He Met Pro Tyr Cys Leu His Leu Met Leu Ala 
555 560 565 570 

tgt ttt aag ctt aga get ttc aca gac aac aga gac gac atg gca ttg 1900 
Cys Phe Lys Leu Arg Ala Phe Thr Asp Asn Arg Asp Asp Met Ala Leu 
575 580 585 

ggg cat gtg att gtg ttg ctt cag caa gag tgg cca egg ggc gag aat 194 8 

Gly His Val He Val Leu Leu Gin Gin Glu Trp Pro Arg Gly Glu Asn 
590 595 600 



1996 



2044 



2092 



2140 



2188 



2236 



69 



gag gac ttt cgc ctg gcc atg gag cgc cag gtc tec cgc tgt gga gag 2284 
Glu Asp Phe Arg Leu Ala Met Glu Arg Gin Val Ser Arg Cys Gly Glu 
700 705 710 

aat ctg atg gtg gtt ctg cac agg ttc tgc att aat gag aag ate ttg 2332 
Asn Leu Met Val Val Leu His Arg Phe Cys He Asn Glu Lys He Leu 
715 720 725 730 

etc ctt cag act ctg ace tga gt ggagaccttt ccaccagaca cagctcgggc 2385 
Leu Leu Gin Thr Leu Thr * 
735 



ctgtgtaatt 


gtaggagaag 


acactcagca 


gtgattgeca 


tggcacagag ccgtggtcat 


2445 


tgttgctgtt 


acaaagaaga 


aaaccatctg 


agttctaact 


ccttggttgc ttaaaagtag 


2505 


ttcccaagag 


tctgagaagc 


tatttctatt 


tttaagagtc 


attttttgta atttttgtaa 


2565 


aacaaaagta 


ccaatctgtt 


ttgtaaataa 


aaatcatcct 


aaaatttgaa aaaaaaaaaa 


2625 



<210> 23 

<211> 6288 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (115) . . (5253) 
<400> 23 

aagggagece cgctcagcgc ggggagegee cggccccctc cccgccccat gcgcccgcgg 



60 



ctctgaagcc tgagegggge egggggctgg geggggcegg ggcccgccgt aggc atg 117 

Met 
l 

gcg tec ggg age egg tgg egg ccg aca ccg ccg ccg ctg ctg ttg ctg 165 
Ala Ser Gly Ser Arg Trp Arg Pro Thr Pro Pro Pro Leu Leu Leu Leu 
5 10 15 

ctg ctg ctg gcg ctg gcg gcg cgc gcg gac ggc ctg gag ttc ggc ggc 213 
Leu Leu Leu Ala Leu Ala Ala Arg Ala Asp Gly Leu Glu Phe Gly Gly 
20 25 30 

ggc ccc ggg cag tgg get cgc tac gcg cgc tgg gcg ggc gcg gcg age 2 61 

Gly Pro Gly Gin Trp Ala Arg Tyr Ala Arg Trp Ala Gly Ala Ala Ser 
35 40 45 

age ggc gag etc age ttc age ctg cgc ace aac gcc acg cgc gcg ctg 3 09 

Ser Gly Glu Leu Ser Phe Ser Leu Arg Thr Asn Ala Thr Arg Ala Leu 
50 55 60 65 

ctg etc tac ctg gac gac ggc ggc gac tgc gac ttc ctg gag ctg ctg 357 
Leu Leu Tyr Leu Asp Asp Gly Gly Asp Cys Asp Phe Leu Glu Leu Leu 



70 



70 



75 



80 



ctg gtg gac ggc cgc ctg egg ctg cgc ttc acg ctt teg tgc gec gag 405 
Leu Val Asp Gly Arg Leu Arg Leu Arg Phe Thr Leu Ser Cys Ala Glu 
85 90 95 

ccg gec acg ctg cag ctg gac acg ccg gtg gec gac gac cgc tgg cac 453 
Pro Ala Thr Leu Gin Leu Asp Thr Pro Val Ala Asp Asp Arg Trp His 
100 105 110 

atg gtg ctg ctg acc cgc gac gcg cgc cgc acg gcg ctg gcg gtg gac 501 
Met Val Leu Leu Thr Arg Asp Ala Arg Arg Thr Ala Leu Ala Val Asp 
115 120 125 

ggc gag gec cgc gec gec gag gtg cgc tec aag egg cgc gag atg cag 54 9 

Gly Glu Ala Arg Ala Ala Glu Val Arg Ser Lys Arg Arg Glu Met Gin 
130 135 140 145 

gtg gec age gac ctg ttc gtg ggc ggc ate ccg ccc gac gtg cgc etc 597 
Val Ala Ser Asp Leu Phe Val Gly Gly He Pro Pro Asp Val Arg Leu 
150 155 160 

teg gcg ctt acg ctg age acc gtc aag tac gag ccg ccc ttc cgc ggt 
Ser Ala Leu Thr Leu Ser Thr Val Lys Tyr Glu Pro Pro Phe Arg Gly 
165 170 175 

etc ttg gec aac ctg aag ctg ggc gag egg ccc ccc gcg ctg ctg ggc 
Leu Leu Ala Asn Leu Lys Leu Gly Glu Arg Pro Pro Ala Leu Leu Gly 
180 185 190 



645 



693 



age cag ggc ctg cgc ggc gec acc gec gac ccg ctg tgc gcg ccc gcg 741 
Ser Gin Gly Leu Arg Gly Ala Thr Ala Asp Pro Leu Cys Ala Pro Ala 
195 200 205 

cgc aac ccc tgc gec aac ggc ggc etc tgc acc gtg ctg gec ccc ggc 789 
Arg Asn Pro Cys Ala Asn Gly Gly Leu Cys Thr Val Leu Ala Pro Gly 
210 215 220 225 

gag gtg ggc tgc gac tgc age cac acg ggc ttc ggc ggc aag ttc tgc 837 
Glu Val Gly Cys Asp Cys Ser His Thr Gly Phe Gly Gly Lys Phe Cys 
230 235 240 

age gaa gag gag cac ccc atg gaa ggt ccg get cac ctg acg tta aac 885 
Ser Glu Glu Glu His Pro Met Glu Gly Pro Ala His Leu Thr Leu Asn 
245 250 255 

age gaa gta ggg tec tta ctg ttc tec gag ggg ggg gee ggg aga gga 933 
Ser Glu Val Gly Ser Leu Leu Phe Ser Glu Gly Gly Ala Gly Arg Gly 
260 265 270 

gga gec ggc gat gtg cac cag cca aca aaa ggc aag gag gag ttt gtg 981 
Gly Ala Gly Asp Val His Gin Pro Thr Lys Gly Lys Glu Glu Phe Val 
275 280 285 

gcg acc ttc aaa ggc aat gag ttc ttc tgc tac gac ctg tea cac aac 102 9 

Ala Thr Phe Lys Gly Asn Glu Phe Phe Cys Tyr Asp Leu Ser His Asn 
290 295 300 305 



71 



ccc ate cag age age act gat gag ate aca ctg gec ttc cgc acc ctg 1077 
Pro lie Gin Ser Ser Thr Asp Glu He Thr Leu Ala Phe Arg Thr Leu 
310 315 320 

caa cgc aac ggc ctg atg ctg cat aca ggc aag teg gec gac tac gtc 1125 
Gin Arg Asn Gly Leu Met Leu His Thr Gly Lys Ser Ala Asp Tyr Val 
325 330 335 

aac ctg tec etc aag tct ggg get gtc tgg ctg gtc ate aac eta ggc 1173 
Asn Leu Ser Leu Lys Ser Gly Ala Val Trp Leu Val He Asn Leu Gly 
340 345 350 

tea ggt gec ttc gag gee ctt gtg gaa ccc gtc aat ggc aag ttc aac 1221 
Ser Gly Ala Phe Glu Ala Leu Val Glu Pro Val Asn Gly Lys Phe Asn 
355 360 365 

gac aac gee tgg cac gac gtc egg gtc acc cga aac ctg cgc cag cac 1269 
Asp Asn Ala Trp His Asp Val Arg Val Thr Arg Asn Leu Arg Gin His 
370 375 380 385 

gca ggg att gga cac get atg gta aac aaa ctg cat tat ctg gtg acc 1317 
Ala Gly He Gly His Ala Met Val Asn Lys Leu His Tyr Leu Val Thr 
390 395 400 

ate teg gtg gac ggg ate ctg acc acc aca ggc tac acg cag gag gat 13 65 

He Ser Val Asp Gly He Leu Thr Thr Thr Gly Tyr Thr Gin Glu Asp 
405 410 415 

tac acc atg ctg ggc tct gat gac ttc ttc tac att ggg ggc age ccc 1413 
Tyr Thr Met Leu Gly Ser Asp Asp Phe Phe Tyr He Gly Gly Ser Pro 
420 425 430 

aac aca get gac ctg ccg ggc teg ccc gtc age aac aac ttc atg ggc 1461 
Asn Thr Ala Asp Leu Pro Gly Ser Pro Val Ser Asn Asn Phe Met Gly 
435 440 445 

tgc etc aag gac gtg gtc tat aag aac aat gac ttc aaa ttg gaa eta 1509 
Cys Leu Lys Asp Val Val Tyr Lys Asn Asn Asp Phe Lys Leu Glu Leu 
450 455 460 465 

tec cgc ctg gca aag gaa ggg gac ccc aag atg aag ctg cag ggg gac 1557 
Ser Arg Leu Ala Lys Glu Gly Asp Pro Lys Met Lys Leu Gin Gly Asp 
470 475 480 

ttg tea ttc cgc tgt gag gat gtg get gec ctg gac cct gtg acc ttt 1605 
Leu Ser Phe Arg Cys Glu Asp Val Ala Ala Leu Asp Pro Val Thr Phe 
485 490 495 

gag agt ccc gag gec ttt gtg gcg ctg ccc cgc tgg age get aag cgc 1653 
Glu Ser Pro Glu Ala Phe Val Ala Leu Pro Arg Trp Ser Ala Lys Arg 
500 505 510 

act ggc tec ate tec eta gac ttc cgc acc acc gag ccc aat ggg ctg 1701 
Thr Gly Ser He Ser Leu Asp Phe Arg Thr Thr Glu Pro Asn Gly Leu 
515 520 525 



72 



ctg etc ttc 
Leu Leu Phe 
530 

age tct get 
Ser Ser Ala 



cac etc tat 
His Leu Tyr 



gca tec age 
Ala Ser Ser 
580 

cag agg gat 
Gin Arg Asp 
595 

ccg ttc ttg 
Pro Phe Leu 
610 

ctg tac ctg 
Leu Tyr Leu 



ccc cca gag 
Pro Pro Glu 



gtg egg gac 
Val Arg Asp 
660 

get gag get 
Ala Glu Ala 
675 

acg ctg aag 
Thr Leu Lys 
690 

cga gaa ggc 
Arg Glu Gly 



ctt ggg egg 
Leu Gly Arg 



tec atg tac 
Ser Met Tyr 
740 

gag gat gtg 



age cag ggc 
Ser Gin Gly 
535 

cag egg gee 
Gin Arg Ala 
550 

ctt ctg ctg 
Leu Leu Leu 
565 

cgc aag gtc 
Arg Lys Val 



ggg cga aaa 
Gly Arg Lys 



gee act gga 
Ala Thr Gly 
615 

ggc ggt etc 
Gly Gly Leu 
630 

gtg tgg aca 
Val Trp Thr 
645 

etc ttc ata 
Leu Phe lie 



cag ggg get 
Gin Gly Ala 



cag tgt gca 
Gin Cys Ala 
695 

tgg aac cgc 
Trp Asn Arg 
710 

gtc tgt gag 
Val Cys Glu 
725 

atg aag ate 
Met Lys lie 



tec ctg cgt 



egg egg get 
Arg Arg Ala 



gac tac ttt 
Asp Tyr Phe 



gac atg gga 
Asp Met Gly 
570 

aat gat ggc 
Asn Asp Gly 
585 

ggc tec ate 
Gly Ser lie 
600 

gac age gag 
Asp Ser Glu 



cct gag ggg 
Pro Glu Gly 



gca gca etc 
Ala Ala Leu 
650 

gat ggg cgt 
Asp Gly Arg 
665 

gtg ggc gtt 
Val Gly Val 
680 

tct gee ccc 
Ser Ala Pro 



ttc ate tgt 
Phe lie Cys 



aga gag gee 
Arg Glu Ala 
730 

atg ctg cct 
Met Leu Pro 
745 

ttc atg tec 



ggg ggt gga 

Gly Gly Gly 
540 

gee atg gag 
Ala Met Glu 
555 

tct ggg ggc 
Ser Gly Gly 



gag tgg tgt 
Glu Trp Cys 



tea gtg aat 
Ser Val Asn 
605 

att ctg gac 
lie Leu Asp 
620 

ggc egg gtg 
Gly Arg Val 
635 

egg gca ggc 
Arg Ala Gly 



age cga gac 
Ser Arg Asp 



gee ccc ttt 
Ala Pro Phe 
685 

tgt cgc aat 
Cys Arg Asn 
700 

gac tgc ate 
Asp Cys lie 
715 

acg gtc ctg 
Thr Val Leu 



aac gee atg 
Asn Ala Met 



cag egg gec 



get ggc age 
Ala Gly Ser 



eta ttg gac 
Leu Leu Asp 
560 

ate aag ctg 
lie Lys Leu 
575 

cac gtg gac 
His Val Asp 
590 

agt cgc age 
Ser Arg Ser 



ctg gag agt 
Leu Glu Ser 



gac ctg ccc 
Asp Leu Pro 
640 

tac gtg ggc 
Tyr Val Gly 
655 

etc egg ggc 
Leu Arg Gly 
670 

tgc tec egg 
Cys Ser Arg 



ggg ggc gtc 
Gly Gly Val 



ggg ace ggc 
Gly Thr Gly 
720 

age tac gat 
Ser Tyr Asp 
735 

cac acg gag 
His Thr Glu 
750 

tac gga etc 



cac 1749 

His 

545 

ggc 1797 
Gly 



egg 1845 
Arg 



ttc 1893 
Phe 



acg 1941 
Thr 



gag 1989 

Glu 

625 

ctg 2037 
Leu 



tgt 2085 
Cys 



ctg 2133 
Leu 



gag 2181 
Glu 



tgt 2229 

Cys 

705 

ttt 2277 
Phe 



ggc 2325 
Gly 



gca 2373 
Ala 



atg 2421 
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Glu Asp Val Ser Leu Arg Phe Met Ser Gin Arg Ala Tyr Gly Leu Met 
755 760 765 



atg gcc acc act tec agg gag tct gec gac acc eta cgc ctg gag ctg 2469 
Met Ala Thr Thr Ser Arg Glu Ser Ala Asp Thr Leu Arg Leu Glu Leu 
770 775 780 785 

gat ggg ggg cag atg aag etc act gtc aac etc gac tgc ctg cgc gtc 2 517 

Asp Gly Gly Gin Met Lys Leu Thr Val Asn Leu Asp Cys Leu Arg Val 
790 795 800 

ggc tgc gca ccc agt aaa ggc ccc gaa acg ctg ttt gcg ggg cac aag 2 565 

Gly Cys Ala Pro Ser Lys Gly Pro Glu Thr Leu Phe Ala Gly His Lys 
805 810 815 

etc aat gac aat gag tgg cac acg gtg agg gtg gtc egg cgt ggc aag 2 613 

Leu Asn Asp Asn Glu Trp His Thr Val Arg Val Val Arg Arg Gly Lys 
820 825 830 

age ctg cag ctg tct gtg gac aac gtg act gtg gag gga cag atg gca 2 661 

Ser Leu Gin Leu Ser Val Asp Asn Val Thr Val Glu Gly Gin Met Ala 
835 840 845 

gga gcc cat atg egg ctg gag ttc cac aac att gag acg ggc ate atg 2709 
Gly Ala His Met Arg Leu Glu Phe His Asn lie Glu Thr Gly He Met 
850 855 860 865 

acg gag egg egg ttt ate tec gtg gtg ccc tec aac ttc ate ggg cat 2757 
Thr Glu Arg Arg Phe He Ser Val Val Pro Ser Asn Phe He Gly His 
870 875 880 



ctg agt ggg etc gtg ttc aat ggc cag ccc tac atg gac cag tgc aag 
Leu Ser Gly Leu Val Phe Asn Gly Gin Pro Tyr Met Asp Gin Cys Lys 
885 890 895 



gca etc gcc acg etc caa gcc tat get tec atg cac etc ttc ttc cag 
Ala Leu Ala Thr Leu Gin Ala Tyr Ala Ser Met His Leu Phe Phe Gin 
930 935 940 945 



ggc aat gac ttc att gtc ate gag ctg gtc aag ggg tac ate cac tac 
Gly Asn Asp Phe He Val He Glu Leu Val Lys Gly Tyr He His Tyr 
965 970 975 

gtg ttt gac ctg ggg aat ggc ccg tec ttg atg aag ggg aac tea gac 
Val Phe Asp Leu Gly Asn Gly Pro Ser Leu Met Lys Gly Asn Ser Asp 



2805 



gat ggt gac ate acc tac tgt gag etc aat get cgc ttt ggc ctg cgt 2 853 
Asp Gly Asp He Thr Tyr Cys Glu Leu Asn Ala Arg Phe Gly Leu Arg 
900 905 910 

gcc att gtg gcc gat ccc gtc acc ttc aag agt cgc age age tac ctg 2901 
Ala He Val Ala Asp Pro Val Thr Phe Lys Ser Arg Ser Ser Tyr Leu 
915 920 925 



2949 



ttc aag acc acg gcc cct gat ggg ctt ctt ctg ttc aac teg ggc aac 2 997 
Phe Lys Thr Thr Ala Pro Asp Gly Leu Leu Leu Phe Asn Ser Gly Asn 
950 955 960 



3045 



3093 
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980 985 990 

aaa cca gtc aat gac aac cag tgg cac aac gtg gtg gtg tec agg gac 3141 
Lys Pro Val Asn Asp Asn Gin Trp His Asn Val Val Val Ser Arg Asp 
995 1000 1005 

cca ggc aac gtg cac acg etc aag att gac tec cgc act gtc acg cag 3189 
Pro Gly Asn Val His Thr Leu Lys He Asp Ser Arg Thr Val Thr Gin 
1010 1015 1020 1025 

cac tec aat ggc gec cga aac etc gat etc aaa ggg gag ttg tac att 32 37 

His Ser Asn Gly Ala Arg Asn Leu Asp Leu Lys Gly Glu Leu Tyr He 
1030 1035 1040 

ggc ggt ctg age aag aat atg ttc age aac ctg ccc aag ctg gtg gee 3 2 85 

Gly Gly Leu Ser Lys Asn Met Phe Ser Asn Leu Pro Lys Leu Val Ala 
1045 1050 1055 

tec egg gat ggc ttt cag ggc tgc ctg gee tea gtg gac etc aac gga 3333 
Ser Arg Asp Gly Phe Gin Gly Cys Leu Ala Ser Val Asp Leu Asn Gly 
1060 1065 1070 

cgt etc cca gac etc ate gec gac gee ctg cac cgc att ggg cag gtg 33 81 

Arg Leu Pro Asp Leu He Ala Asp Ala Leu His Arg He Gly Gin Val 
1075 1080 1085 

gag agg ggc tgt gat ggc ccc age ace ace tgc act gaa gag tec tgt 342 9 

Glu Arg Gly Cys Asp Gly Pro Ser Thr Thr Cys Thr Glu Glu Ser Cys 
1090 1095 1100 1105 

gec aac cag ggc gtc tgc ttg cag cag tgg gat ggc ttc ace tgc gac 34 77 

Ala Asn Gin Gly Val Cys Leu Gin Gin Trp Asp Gly Phe Thr Cys Asp 
1110 1115 1120 

tgc ace atg act tec tat gga ggc cct gtc tgc aat gat ccc ggg ace 3525 
Cys Thr Met Thr Ser Tyr Gly Gly Pro Val Cys Asn Asp Pro Gly Thr 
1125 1130 1135 

aca tac ate ttt ggg aag ggg gga gcg etc ate acc tac acg tgg ccc 3573 
Thr Tyr He Phe Gly Lys Gly Gly Ala Leu He Thr Tyr Thr Trp Pro 
1140 1145 1150 

ccc aat gac agg ccc age acg agg atg gat cgc ctg gee gtg ggc ttc 3 621 
Pro Asn Asp Arg Pro Ser Thr Arg Met Asp Arg Leu Ala Val Gly Phe 
1155 1160 1165 

age acc cac cag egg age get gtg ctg gtg egg gtg gac age gec tec 3669 
Ser Thr His Gin Arg Ser Ala Val Leu Val Arg Val Asp Ser Ala Ser 
1170 1175 1180 1185 

ggc ctt gga gac tac ctg cag ctg cac ate gac cag ggc acc gtg ggg 3 717 

Gly Leu Gly Asp Tyr Leu Gin Leu His He Asp Gin Gly Thr Val Gly 
1190 1195 1200 

gtg ate ttt aac gtg ggc acg gac gac att acc ate gac gag ccc aac 3765 
Val He Phe Asn Val Gly Thr Asp Asp He Thr He Asp Glu Pro Asn 
1205 1210 1215 
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gcc ata gta age gac ggc aaa tac cac gtg gtg cgc ttc act cga age 3 813 

Ala lie Val Ser Asp Gly Lys Tyr His Val Val Arg Phe Thr Arg Ser 
1220 1225 1230 

ggc ggc aac gcc acc ctg cag gtg gac age tgg ccg gtc aac gag egg 3 861 

Gly Gly Asn Ala Thr Leu Gin Val Asp Ser Trp Pro Val Asn Glu Arg 
1235 1240 1245 

tac ccg gca gga aac ttt gat aac gag cgc ctg gcg att get aga cag 3909 
Tyr Pro Ala Gly Asn Phe Asp Asn Glu Arg Leu Ala lie Ala Arg Gin 
1250 1255 1260 1265 

aga ate ccc tac egg ctt ggt cga gta gta gat gag tgg ctg etc gac 3 957 
Arg lie Pro Tyr Arg Leu Gly Arg Val Val Asp Glu Trp Leu Leu Asp 
1270 1275 1280 

aaa ggc cgc cag ctg acc ate ttc aac age cag get gcc ate aag ate 4005 
Lys Gly Arg Gin Leu Thr lie Phe Asn Ser Gin Ala Ala lie Lys lie 
1285 1290 1295 

ggg ggc egg gat cag ggc cgc ccc ttc cag ggc cag gtg tec ggc etc 4053 
Gly Gly Arg Asp Gin Gly Arg Pro Phe Gin Gly Gin Val Ser Gly Leu 
1300 1305 1310 

tac tac aat ggg etc aag gtg ctg gcg ctg gcc gcc gag age gac ccc 4101 
Tyr Tyr Asn Gly Leu Lys Val Leu Ala Leu Ala Ala Glu Ser Asp Pro 
1315 1320 1325 

aat gtg egg act gag ggt cac ctg cgc ctg gtg ggg gag ggg ccg tec 4149 
Asn Val Arg Thr Glu Gly His Leu Arg Leu Val Gly Glu Gly Pro Ser 
1330 1335 1340 1345 

gtg ctg etc agt gcg gag acc acg gcc acc acc ctg ctg get gac atg 4197 
Val Leu Leu Ser Ala Glu Thr Thr Ala Thr Thr Leu Leu Ala Asp Met 
1350 1355 1360 

gcc acc acc ate atg gag act acc acc acc atg gcc act acc acc acg 4245 
Ala Thr Thr lie Met Glu Thr Thr Thr Thr Met Ala Thr Thr Thr Thr 
1365 1370 1375 

cgc egg ggc cgc tec ccc aca ctg agg gac age acc acc cag aac aca 42 93 

Arg Arg Gly Arg Ser Pro Thr Leu Arg Asp Ser Thr Thr Gin Asn Thr 
1380 1385 1390 

gat gac ctg ctg gtg gcc tct get gag tgt cca age gat gat gag gac 4341 
Asp Asp Leu Leu Val Ala Ser Ala Glu Cys Pro Ser Asp Asp Glu Asp 
1395 1400 1405 

ctg gag gag tgt gag ccc agt act gga gga gag tta ata ttg ccc att 43 8 9 

Leu Glu Glu Cys Glu Pro Ser Thr Gly Gly Glu Leu lie Leu Pro lie 
1410 1415 1420 1425 

ate acg gag gac tec tta gac ccc cct ccc gtg gcc acc cga tec ccc 4437 
lie Thr Glu Asp Ser Leu Asp Pro Pro Pro Val Ala Thr Arg Ser Pro 
1430 1435 1440 
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ttc gtg ccc ccg ccc cct acc ttc tac ccc ttc etc acg gga gtg ggc 4485 
Phe Val Pro Pro Pro Pro Thr Phe Tyr Pro Phe Leu Thr Gly Val Gly 
1445 1450 1455 

gec acc caa gac acg ctg ccc ccg ccc gec gcg cgc cgc ccg ccc tct 4533 
Ala Thr Gin Asp Thr Leu Pro Pro Pro Ala Ala Arg Arg Pro Pro Ser 
1460 1465 1470 

ggg ggc ccg tgc cag gec gag egg gac gac age gac tgc gag gag ccc 45 81 

Gly Gly Pro Cys Gin Ala Glu Arg Asp Asp Ser Asp Cys Glu Glu Pro 
1475 1480 1485 

ate gag gee teg ggc ttc gec tec ggg gag gtc ttt gac tec age etc 4629 
He Glu Ala Ser Gly Phe Ala Ser Gly Glu Val Phe Asp Ser Ser Leu 
1490 1495 1500 1505 



ccc ccc acg gac gac gag gac ttt tac acc acc ttt ccc ctg gtc acg 
Pro Pro Thr Asp Asp Glu Asp Phe Tyr Thr Thr Phe Pro Leu Val Thr 
1510 1515 1520 



4677 



gac cgc acc acc etc ctg tea ccc cgc aaa ccc get ccc egg ccc aac 4725 
Asp Arg Thr Thr Leu Leu Ser Pro Arg Lys Pro Ala Pro Arg Pro Asn 
1525 1530 1535 

etc agg aca gat ggg gee acg ggc gec cct ggg gtg ctg ttt gec ccc 4773 
Leu Arg Thr Asp Gly Ala Thr Gly Ala Pro Gly Val Leu Phe Ala Pro 
1540 1545 1550 

tec gec ccg gee ccc aac ctg ccg gcg gga aaa atg aac cac cga gac 4821 
Ser Ala Pro Ala Pro Asn Leu Pro Ala Gly Lys Met Asn His Arg Asp 
1555 1560 1565 

ccg ctt cag ccc ttg ctg gag aac ccg ccc ttg ggg ccc ggg gee ccc 4869 
Pro Leu Gin Pro Leu Leu Glu Asn Pro Pro Leu Gly Pro Gly Ala Pro 
1570 1575 1580 1585 

acg tec ttt gag ccg egg agg ccc cct ccc ctg cgc ccc ggc gtg acc 4917 
Thr Ser Phe Glu Pro Arg Arg Pro Pro Pro Leu Arg Pro Gly Val Thr 
1590 1595 1600 

tea gec ccc ggc ttc ccc cat ctg ccc aca gec aac ccc aca ggg cct 4965 
Ser Ala Pro Gly Phe Pro His Leu Pro Thr Ala Asn Pro Thr Gly Pro 
1605 1610 1615 

ggg gag egg ggc ccg ccg ggc gca gtg gag gtg ate egg gag tec age 5013 
Gly Glu Arg Gly Pro Pro Gly Ala Val Glu Val He Arg Glu Ser Ser 
1620 1625 1630 

age acc acg ggc atg gtg gtg ggc att gtg gcg gcg gcg gcg etc tgc 5061 
Ser Thr Thr Gly Met Val Val Gly He Val Ala Ala Ala Ala Leu Cys 
1635 1640 1645 

ate etc ate etc etc tac gec atg tat aag tac cgc aat cgt gat gag 5109 
He Leu He Leu Leu Tyr Ala Met Tyr Lys Tyr Arg Asn Arg Asp Glu 
1650 1655 1660 1665 

ggc tec tac cag gtg gac cag age cga aac tac ate agt aac teg gec 5157 
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Gly Ser Tyr Gin Val Asp Gin Ser Arg Asn Tyr He Ser Asn Ser Ala 
1670 1675 1680 

cag age aat ggg gcg gtg gtg aaa gag aag gec ccg get gec ccc aag 5205 
Gin Ser Asn Gly Ala Val Val Lys Glu Lys Ala Pro Ala Ala Pro Lys 
1685 1690 1695 

acg ccc age aag gec aag aag aac aaa gac aag gag tat tat gtc tga 52 53 
Thr Pro Ser Lys Ala Lys Lys Asn Lys Asp Lys Glu Tyr Tyr Val * 
1700 1705 1710 

gcccccggca ctgcgcccca ctgccagctg cccctcctgg gagggecegg gaggagggtg 5313 

ccaccctctc cctgccaggg gectggggae cctctccctg gctgcctcag gcttctctta 5373 

cgaagaggaa aegcaaaaaa agaaaaggaa aaaccccgtg ctcgccccct tcctcctgcc 5433 

gtccactgcg cggcctcgtc agtcccgggg ctgactgtcc ctctcagctc tgcgcctgcc 5493 

aggcagggca cgtgctcaca gccctgggtt gatttatttt tttaaggggg tagttttatt 5553 

ttggtggggt tgggtgggaa ggaaggctgg gggttttgta aagtgtccac tgctcgtcct 5613 

gttaattttc ctcaattttt cttcttcttc cttctgtccc tcctgccttc cttctcttcc 5673 

caagccctcc aatccccatc ccaggcttgc tgtgtctcac tgtccccacc ctccttccct 5733 

acttcttttt ttgtgtgtct ggtttctccc ttcctttcct ccctttgggt ttccagagtc 5793 

ggtgggagaa gggegggagg gtgggcccga gtggcccagt gggtgggtgg ggtggggtgg 5853 

ggcaagtgcc ccaactcccc tcaccaggag aggcacctgc ttggtgccgc ccagggaagg 5913 

ggctcaggcc tgaeggaagg cctgttctgt gtgtgccgcc gggcgacgtg cattgatggg 5973 

gaagctgctg gaggagcagg ggtggggggt gggagggagg ggaaaggcaa atgeagatat 6033 

atattacaga caaatactct agattccacg agcagcagcc tgtggcaccc gctgggcgcg 6093 

ggcagcaggg aagagggagc aaggcattgt ccacagactg ctggggtcac ttctttgccc 6153 

acgggctccc tgctccccca gttttttttc tctctttgtt aacaaatgtg tctgagtctt 6213 

ggaaaacacc ccaaccccgg aaatgtgtgg gaaaaagaaa acaaaaactt tccaaattcc 6273 

aaaaaaaaaa aaaaa 62 8 8 



<210> 24 

<211> 1016 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
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<222> (254) . . (724) 



<400> 24 

ttaatacgat gactataggg aatttggccc tcgagcaaga attcggcacg aggctggctc 60 

tgactcttca cccatcttca cccaggctgg cccctttggt gaaactacaa ctcccagggg 120 

tctgtgcgcg agaaggcagg cgggtttttc taccggaagt ccgctctagc tctgggccct 180 

acaactgcac cctgagccgg agctgcccag tcgccgcggg accggggccg ctggggtctg 240 

gacgggggtc gcc atg ate cgc ttt ate etc ate cag aac egg gca ggc 2 89 

Met lie Arg Phe He Leu He Gin Asn Arg Ala Gly 
15 10 

aag acg cgc ctg gcc aag tgg tac atg cag ttt gat gat gat gag aaa 3 37 

Lys Thr Arg Leu Ala Lys Trp Tyr Met Gin Phe Asp Asp Asp Glu Lys 
15 20 25 

cag aag ctg ate gag gag gtg cat gcc gtg gtc acc gtc cga gac gcc 3 85 

Gin Lys Leu He Glu Glu Val His Ala Val Val Thr Val Arg Asp Ala 
30 35 40 

aaa cac acc aac ttt gtg gag gtc ctg gca age tec gtt get gac age 433 
Lys His Thr Asn Phe Val Glu Val Leu Ala Ser Ser Val Ala Asp Ser 
45 50 55 60 



etc tct gtt ctg cag ttc egg aac ttt aag ate att tac cgc cgc tat 
Leu Ser Val Leu Gin Phe Arg Asn Phe Lys He He Tyr Arg Arg Tyr 
65 70 75 



481 



get ggc etc tac ttc tgc ate tgt gtg gat gtc aat gac aac aac ctg 529 

Ala Gly Leu Tyr Phe Cys He Cys Val Asp Val Asn Asp Asn Asn Leu 
80 85 90 

get tac ctg gag gcc att cac aac ttc gtg gag gtc tta aac gaa tat 577 

Ala Tyr Leu Glu Ala He His Asn Phe Val Glu Val Leu Asn Glu Tyr 
95 100 105 

ttc cac aat gtc tgt gaa ctg gac ctg gtg ttc aac ttc tac aag gtt 625 

Phe His Asn Val Cys Glu Leu Asp Leu Val Phe Asn Phe Tyr Lys Val 
110 115 120 

tac acg gtc gtg gac gag atg ttc ctg get ggc gaa ate cga gag acc 673 

Tyr Thr Val Val Asp Glu Met Phe Leu Ala Gly Glu He Arg Glu Thr 
125 130 135 140 

age cag acg aag gtg ctg aaa cag ctg ctg atg eta cag tec ctg gag 721 
Ser Gin Thr Lys Val Leu Lys Gin Leu Leu Met Leu Gin Ser Leu Glu 

145 150 155 

tga gggc aggcgagccc caccccggcc ccggcccctc ctggactcgc ctgctcgctt 77 8 



ccccttccca ggcccgtggc caacccagca gtccttccct cagctgccta ggaggaaggg 83 8 
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acccagctgg gtctgggcca caagggagga gactgcaccc cactgcctct gggccctggc 
tgtgggcaga ggccaccgtg tgtgtcccga gtaaccgtgc cgttgtcgtg tgatgccata 
agcgtctgtg cgtggagtcc ccaataaacc tgtggtcctg cctggcaaaa aaaaaaaa 



898 
958 
1016 



<210> 25 

<211> 977 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (46) . . (828) 



<400> 25 

agtgagcgct ggtggcggag ttaaagtcaa agcaggagag taatt atg aat age 54 

Met Asn Ser 
1 



gca gcg gga ttc tea cac eta gac cgt cgc gag egg gtt etc aag tta 
Ala Ala Gly Phe Ser His Leu Asp Arg Arg Glu Arg Val Leu Lys Leu 
5 10 15 

999 9^g agt ttc gag aag cag ccg cgc tgc gec ttc cac act gtg cgc 
Gly Glu Ser Phe Glu Lys Gin Pro Arg Cys Ala Phe His Thr Val Arg 
20 25 30 35 



102 



150 



tat gac ttc aaa cct get tct att gac act tct tct gaa gga tac ctt 198 
Tyr Asp Phe Lys Pro Ala Ser He Asp Thr Ser Ser Glu Gly Tyr Leu 
40 45 50 



gag gtt ggt gaa ggt gaa cag gtg ace ata act ctg cca aat ata gaa 24 6 

Glu Val Gly Glu Gly Glu Gin Val Thr He Thr Leu Pro Asn He Glu 
55 60 65 



ggt tea act cca cca gta act gtt ttc aaa ggt tea aaa aaa cct tac 294 
Gly Ser Thr Pro Pro Val Thr Val Phe Lys Gly Ser Lys Lys Pro Tyr 
70 75 80 

tta aaa gaa tgc att ttg att att aac cat gat act gga gaa tgt egg 342 
Leu Lys Glu Cys He Leu He He Asn His Asp Thr Gly Glu Cys Arg 
85 90 95 



eta gaa aaa etc age age aac ate act gta aaa aaa aca aga gtt gaa 390 
Leu Glu Lys Leu Ser Ser Asn He Thr Val Lys Lys Thr Arg Val Glu 
100 105 110 115 



gga age agt aaa att cag tat cgt aaa gaa caa cag caa caa caa atg 43 8 

Gly Ser Ser Lys He Gin Tyr Arg Lys Glu Gin Gin Gin Gin Gin Met 
120 125 130 

tgg aat tea gee agg act ccc aat ctt gta aaa cat tct cca tct gaa 486 
Trp Asn Ser Ala Arg Thr Pro Asn Leu Val Lys His Ser Pro Ser Glu 



80 



135 



140 



145 



gat aag atg tec cca gca tct cca ata gat gat ate gaa aga gaa ctg 534 
Asp Lys Met Ser Pro Ala Ser Pro lie Asp Asp lie Glu Arg Glu Leu 
150 155 160 

aag gca gaa get agt eta atg gac cag atg agt agt tgt gat agt tea 582 
Lys Ala Glu Ala Ser Leu Met Asp Gin Met Ser Ser Cys Asp Ser Ser 
165 170 175 

tea gat tec aaa agt tea tea tct tea agt agt gag gat agt tct agt 630 
Ser Asp Ser Lys Ser Ser Ser Ser Ser Ser Ser Glu Asp Ser Ser Ser 
180 185 190 195 

gac tea gaa gat gaa gat tgc aaa tec tct act tct gat aca ggg aat 678 
Asp Ser Glu Asp Glu Asp Cys Lys Ser Ser Thr Ser Asp Thr Gly Asn 
200 205 210 

tgt gtc tea gga cat cct acc atg aca cag tac agg att cct gat ata 726 
Cys Val Ser Gly His Pro Thr Met Thr Gin Tyr Arg lie Pro Asp lie 
215 220 225 

gat gee agt cat aat aga ttt cga gac aac agt ggc ctt ctg atg aat 774 
Asp Ala Ser His Asn Arg Phe Arg Asp Asn Ser Gly Leu Leu Met Asn 
230 235 240 

act tta aga aat gat ttg cag ctg agt gaa tea gga agt gac agt gat 822 
Thr Leu Arg Asn Asp Leu Gin Leu Ser Glu Ser Gly Ser Asp Ser Asp 
245 250 255 

gac tga agaaatattt agctataaat aaaaatttat acagcatgta taatttattt 878 

Asp * 

260 

tgtattaaca ataaaaattc ctaagactga gggaaatatg tcttaacttt tgatgataaa 938 
agaaattaaa tttgattcag aaatttcaaa aaaaaaaaa 977 



<210> 26 

<211> 3185 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (37) . . (1257) 

<400> 26 

accccggcgg ccctgcccgc ccctccctcc agcatc atg gec age cca aga acc 54 

Met Ala Ser Pro Arg Thr 
1 5 

agg aag gtt ctt aaa gaa gtc agg gtg cag gat gag aac aac gtt tgt 102 
Arg Lys Val Leu Lys Glu Val Arg Val Gin Asp Glu Asn Asn Val Cys 



81 



10 



15 



20 



ttt gag tgt 
Phe Glu Cys 
25 

ate tgg ate 
lie Trp lie 
40 

cac etc age 
His Leu Ser 
55 

gag ctt gag 
Glu Leu Glu 



ctg gag tct 
Leu Glu Ser 



tac aac age 
Tyr Asn Ser 
105 

gec gaa ggc 
Ala Glu Gly 
120 

acc cca cct 
Thr Pro Pro 
135 

ggc cag ccg 
Gly Gin Pro 



tgg ctg aat 
Trp Leu Asn 



tac gtg ggg 
Tyr Val Gly 
185 

ttc etc aac 
Phe Leu Asn 
200 

acc act gga 
Thr Thr Gly 
215 

aag ttt gga 
Lys Phe Gly 



ggc gcg ttc 
Gly Ala Phe 



tgc ctg gag 
Cys Leu Glu 



ttt gtg cgc 
Phe Val Arg 
60 

aag atg aaa 
Lys Met Lys 
75 

cag gag gat 
Gin Glu Asp 
90 

aga gee gcg 
Arg Ala Ala 



aga gag tgg 
Arg Glu Trp 



cag ccc agg 
Gin Pro Arg 
140 

cag agt gtg 
Gin Ser Val 
155 

gat gac etc 
Asp Asp Leu 
170 

ttt ggg aac 
Phe Gly Asn 



aac gee atg 
Asn Ala Met 



gee age egg 
Ala Ser Arg 
220 

tec caa gcg 
Ser Gin Ala 
235 



aat cct cag 
Asn Pro Gin 
30 

tgc teg ggg 
Cys Ser Gly 
45 

tct gtt act 
Ser Val Thr 



get ggt ggg 
Ala Gly Gly 



tac gat cct 
Tyr Asp Pro 
95 

gec etc ttt 
Ala Leu Phe 
110 

tct ctg gag 
Ser Leu Glu 
125 

acg ctg ccg 
Thr Leu Pro 



acc gec tec 
Thr Ala Ser 



ggc tec tat 
Gly Ser Tyr 
175 

acg cca ccg 
Thr Pro Pro 
190 

tec tec ctg 
Ser Ser Leu 
205 

ttt gee teg 
Phe Ala Ser 



agt cag aag 
Ser Gin Lys 



tgg gtc agt 
Trp Val Ser 



aga cac cgc 
Arg His Arg 
50 

atg gac aag 
Met Asp Lys 
65 

aat get aag 
Asn Ala Lys 
80 

tgc tgg tec 
Cys Trp Ser 



agg gat aag 
Arg Asp Lys 



tea tea cct 
Ser Ser Pro 
130 

tec atg gtg 
Ser Met Val 
145 

teg gac aag 
Ser Asp Lys 
160 

caa ggg gec 
Gin Gly Ala 



cct cag aag 
Pro Gin Lys 



tac teg ggc 
Tyr Ser Gly 
210 

gca gee aag 
Ala Ala Lys 
225 

gcg tec gag 
Ala Ser Glu 
240 



gtg acc tac 
Val Thr Tyr 
35 

ggg ctt ggg 
Gly Leu Gly 



tgg aag gac 
Trp Lys Asp 



ttc cga gag 
Phe Arg Glu 
85 

ttg cag gag 
Leu Gin Glu 
100 

gtg gtc get 
Val Val Ala 
115 

gec cag aac 
Ala Gin Asn 



cac cga gtc 
His Arg Val 



get ttt gaa 
Ala Phe Glu 
165 

cag ggg aat 
Gin Gly Asn 
180 

aaa gaa gat 
Lys Glu Asp 
195 

tgg age age 
Trp Ser Ser 



gag ggc get 
Glu Gly Ala 



ctg ggc cac 
Leu Gly His 
245 



ggc 15 0 

Gly 



gtt 198 
Val 



att 246 
He 
70 

ttc 294 
Phe 



aag 342 
Lys 



ctg 390 
Leu 



tgg 438 
Trp 



tct 486 

Ser 

150 

gac 534 
Asp 



cgc 582 
Arg 



gac 63 0 

Asp 



ttc 678 
Phe 



aca 726 

Thr 

230 

age 774 
Ser 
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ctg aac gag aac gtc etc aag cct gcg cag gag aag gtg aag gag gga 822 
Leu Asn Glu Asn Val Leu Lys Pro Ala Gin Glu Lys Val Lys Glu Gly 
250 255 260 

aag att ttt gat gat gtc tec agt ggg gtc tct cag ttg gcg tec aag 870 
Lys lie Phe Asp Asp Val Ser Ser Gly Val Ser Gin Leu Ala Ser Lys 
265 270 275 

gtc cag gga gtc ggt agt aag gga tgg egg gac gtc acc acc ttt ttt 918 
Val Gin Gly Val Gly Ser Lys Gly Trp Arg Asp Val Thr Thr Phe Phe 
280 285 290 

tc 9 999 aaa 9 CSi 9&9 99 c ccc ttg gac age ccc teg gag ggc cac agt 966 
Ser Gly Lys Ala Glu Gly Pro Leu Asp Ser Pro Ser Glu Gly His Ser 
295 300 305 310 

tat cag aac age ggt ctg gac cac ttc caa aac age aac ata gac cag 1014 
Tyr Gin Asn Ser Gly Leu Asp His Phe Gin Asn Ser Asn lie Asp Gin 
315 320 325 

age ttc tgg gag acc ttt gga agt get gag ccc acc aag acc cgc aag 1062 
Ser Phe Trp Glu Thr Phe Gly Ser Ala Glu Pro Thr Lys Thr Arg Lys 
330 335 340 

tec ccg age age gac age tgg acg tgc gcg gac acc tec acc gag agg 1110 
Ser Pro Ser Ser Asp Ser Trp Thr Cys Ala Asp Thr Ser Thr Glu Arg 
345 350 355 

a 99 a 9 c teg gac age tgg gag gtg tgg ggc teg gee tec acc aac agg 115 8 

Arg Ser Ser Asp Ser Trp Glu Val Trp Gly Ser Ala Ser Thr Asn Arg 
360 365 370 

aac age aac age gac ggc ggg gag ggc ggg gag ggc acc aag aag gca 12 0 6 

Asn Ser Asn Ser Asp Gly Gly Glu Gly Gly Glu Gly Thr Lys Lys Ala 
375 380 385 390 

gtg ccg ccg gee gtg ccc act gat gat ggc tgg gac aac cag aac tgg 1254 
Val Pro Pro Ala Val Pro Thr Asp Asp Gly Trp Asp Asn Gin Asn Trp 
395 400 405 

tag ggee cactgcgccc ccgtccccag cgcccccggg egacttegtg tttgeactet 1311 



gccctcgtcg ttcctcctcc ttccatttga cccaagaatc ageaactgea gtgtgaggac 1371 

agegtctegg gaggcaggac cctagggaga cccgggtgtg cgccgcctgc gcgtggggag 1431 

tcttcggtgc gtgggggcgg cttgctgtcc agcctgtgtg ggggccgtcc cgtcccacac 1491 

tcccctgggc attcttggac teaaggcegg ggctctgcgt ggcttgctgg gaggtgggct 1551 

gcagcacaga ggcctgtgac tgcgttccag cggccagttc actaegcagt atctctgggg 1611 

cctgggacca gccacgtgcc gagctgtcag egaegtgagg tgtcccttct cgttgagata 1671 
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tttaactttg 


gttttgctct 


agttctttct 


ttttgaagag 


agtgactgga 


gtggtaaaga 


1731 


tggaaatgct 


ggaaatgata 


ctggcgctca 


cgctgccatc 


cgaccaccct 


cggctcccga 


1791 


gtccacgcct 


gcctgggcct 


gtgctgtcag 


acccgcgtcg 


gtcgtaaccc 


tctgtggctc 


1851 


ccctgcatca 


gcaccgtccc 


accaccaagt 


tcaccaggtt 


caccagacac 


ggcctccaca 


1911 


atagccacac 


ccacacctga 


gctgttctca 


gtgctggaac 


ttgaccatcc 


tggaacaccc 


1971 


tggaagaaaa 


aggagcgcag 


ggtgggccct 


cggccctgat 


gcaggagggt 


gcgatagcgg 


2031 


acgtggccag 


gcaggagggg 


ccgggttcag 


gagctgagca 


ggggatgcct 


gtgcgtggtg 


2091 


cctgggtcta 


gggaagctcc 


agccccagga 


tggggctgcc 


ctgcacaccg 


gtgcccgcca 


2151 


catgccaacc 


ctcacctccc 


cgaggactgg 


atgatgtgct 


gccacgtgtg 


actcgtctcc 


2211 


cttgtctgcc 


ctgtgtgacc 


ctcagtcttg 


gccagccatg 


catgcgcccg 


aagctcgtgc 


2271 


agtttgtacg 


tgaggtgctc 


tcctccctgc 


caccatgctc 


atcactctgg 


ccttggccat 


2331 


gctccctggt 


caccccactt 


cccggtcgcc 


gtctgcagca 


ctcctggagc 


agcctgggcc 


2391 


cttcagcccc 


tgtgctcgtc 


ccaccctagg 


gactcagcca 


cttgcagaac 


aggatgggac 


2451 


cgagatttca 


gcgagccctc 


ctggcgcccg 


gtcctccctg 


tgggcaccag 


ccctcttggt 


2511 


agctggtgtg 


gagggccggt 


gtccttggct 


gccacggagg 


gatttgatca 


ccgaagcagc 


2571 


cacctgctgt 


agttggacct 


gaggtcagag gcggggcatc 


agaggctcaa 


ggtgctgaga 


2631 


agccaccggg 


aaagcagcca 


gcacaaaggg 


cccaggaagc 


cagcccccga 


gagctgagcg 


2691 


tgggggtctt 


tgagtgtctt 


tctccaagct 


gagacgtggg 


cggccgcgtg 


gtatctcccg 


2751 


agggctgctt 


ggaccctggt 


gggctgagtg 


ctccgaggag 


gggtggactc 


caccttggac 


2811 


agtgggatgt 


ggtgttccac 


atgtgcctgt 


ttccacgcca 


gcaccttgac 


ttggcagcat 


2871 


ggagccaagg 


tctgtccccg 


cccaggaggg 


tgccttcctc 


gggggtaggg 


ggacggccca 


2931 


ctctgcccca 


gggagtccct 


tttgatggga 


agtgcagtca 


gcagcgtgga 


ggtgtctggg 


2991 


ccaccttcag 


aaggtggatg 


tggtggccga 


gaccccgtcc 


acggagggtg 


atggcctttc 


3051 


ccttctgcag 


gtgcgggcag 


gtgggcctgg 


gaccggtgct 


ggggcctctc 


cttgctgtgt 


3111 


gtgagggccc 


aggtggaagg 


cgcggacctg 


acagcattcc 


aataaagcat 


acgggaacat 


3171 


gcaaaaaaaa 


aaaa 










3185 



<210> 27 
<211> 1850 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (596) . . (1840) 
<400> 27 

tttttgtatg caccacgggc ggcggtggtc ggtgcgggag gagggagggg agcttgcggg 60 

cccgagaggg ggcgacggcg gcggcggtgg cctgaggagg cccgagcggc ggcggtggcg 12 0 

gcgaaggccg aggcgtctag gtgtttttgg aagagctgca gccctcttct cacagatgag 18 0 

ctacgaggag atgatgacac tgactgagca gcacctggag tctcagaacg tcaccaaagg 240 

tgcccgccac aagatagccc tgagcatcca gaagctgcgt gagagacaga gcgtcctcaa 300 

gtccctagag aaggatgtgc tggaaggcgg gaacctacga aacgctctgc aggagctgca 360 

gcagatcatc atcactccca tcaaggccta cagtgtcctc caggccaccg tggctgccgc 420 

caccaccacc cctactgcca aggatggggc cccgggggaa ccaccgctgc caggtgctga 480 

gcctccccta gcccaccccg gcacagacaa aggcaccgag gccagagccg ggaccatgtg 540 

acggcgctgg ccctcgccac cgccgtcccc cgaccctggc cccaggcccg gcacc atg 598 

Met 
1 

atg ttc cga gac cag gtg ggc ate etc get ggc tgg ttc aaa ggc tgg 646 
Met Phe Arg Asp Gin Val Gly lie Leu Ala Gly Trp Phe Lys Gly Trp 
5 10 15 

aat gag tgt gag cag aca gtg gec etc ctg tea ctt ccg aaa egg gtc 694 
Asn Glu Cys Glu Gin Thr Val Ala Leu Leu Ser Leu Pro Lys Arg Val 
20 25 30 

ace cgt ace cag gec cgc ttc ctg cag etc tgc ctg gag cac tea ctg 742 
Thr Arg Thr Gin Ala Arg Phe Leu Gin Leu Cys Leu Glu His Ser Leu 
35 40 45 

gcg gac tgc aat gac ate cac ctg ctg gag teg gag gec aac agt get 790 
Ala Asp Cys Asn Asp lie His Leu Leu Glu Ser Glu Ala Asn Ser Ala 
50 55 60 65 

gec ate gtc age cag tgg cag cag gag tec aaa gag aag gtg gtg tec 83 8 

Ala lie Val Ser Gin Trp Gin Gin Glu Ser Lys Glu Lys Val Val Ser 
70 75 80 

etc ctg ctg tec cac ctt ccc ctg ctt cag cca ggc aac aca gag gee 886 
Leu Leu Leu Ser His Leu Pro Leu Leu Gin Pro Gly Asn Thr Glu Ala 
85 90 95 

aag teg gag tac atg agg eta ctg cag aaa gtg ctg gee tac tea ate 934 
Lys Ser Glu Tyr Met Arg Leu Leu Gin Lys Val Leu Ala Tyr Ser lie 
100 105 110 
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gag age aat get ttc ate gag gag agt cgc cag ctg ctt tec tat gee 982 
Glu Ser Asn Ala Phe lie Glu Glu Ser Arg Gin Leu Leu Ser Tyr Ala 
115 120 125 

etc ate cac cca gec ace aca ctg gag gac cgc aac gca ctg gec etc 1030 
Leu lie His Pro Ala Thr Thr Leu Glu Asp Arg Asn Ala Leu Ala Leu 
130 135 140 145 

tgg ctg age cac ctg gaa gag egg ttg get agt ggc ttc cgc tec egg 1078 
Trp Leu Ser His Leu Glu Glu Arg Leu Ala Ser Gly Phe Arg Ser Arg 
150 155 160 

cca gag ccc tec tac cat tea cgt caa ggc tea gat gag tgg ggg ggc 1126 
Pro Glu Pro Ser Tyr His Ser Arg Gin Gly Ser Asp Glu Trp Gly Gly 
165 170 175 

cct gca gag eta ggc cct ggg gag gca ggg cca ggc tgg cag gac aag 1174 
Pro Ala Glu Leu Gly Pro Gly Glu Ala Gly Pro Gly Trp Gin Asp Lys 
180 185 190 

cca ccc egg gaa aat gga cac gtg ccc ttc cac cca tec age tea gtg 1222 
Pro Pro Arg Glu Asn Gly His Val Pro Phe His Pro Ser Ser Ser Val 
195 200 205 

ccg cca gec ate aac agt att ggg age aat gca aac aca ggt etc ccc 1270 
Pro Pro Ala lie Asn Ser lie Gly Ser Asn Ala Asn Thr Gly Leu Pro 
210 215 220 225 

tgc caa ate cac cct age cca ctg aag cgc tec atg tea etc ate cct 1318 
Cys Gin lie His Pro Ser Pro Leu Lys Arg Ser Met Ser Leu lie Pro 
230 235 240 

aca age ccc cag gtc cct ggt gag tgg ccg agt cca gag gag ctt ggg 13 66 

Thr Ser Pro Gin Val Pro Gly Glu Trp Pro Ser Pro Glu Glu Leu Gly 
245 250 255 

gec egg get get ttt acc acg ccc gat cac gca cct etc teg ccc cag 1414 
Ala Arg Ala Ala Phe Thr Thr Pro Asp His Ala Pro Leu Ser Pro Gin 
260 265 270 

age age gtg gec tec tct ggc agt gag cag aca gag gag cag ggc tec 1462 
Ser Ser Val Ala Ser Ser Gly Ser Glu Gin Thr Glu Glu Gin Gly Ser 
275 280 285 

age egg aac acc ttc cag gag gat ggc agt ggc atg aaa gat gtg ccc 1510 
Ser Arg Asn Thr Phe Gin Glu Asp Gly Ser Gly Met Lys Asp Val Pro 
290 295 300 305 

tea tgg etc aag age etc cgt ttg cac aag tat gca gee etc ttc tea 1558 
Ser Trp Leu Lys Ser Leu Arg Leu His Lys Tyr Ala Ala Leu Phe Ser 
310 315 320 

cag atg age tac gag gag atg atg aca ctg act gag cag cac ctg gag 1606 
Gin Met Ser Tyr Glu Glu Met Met Thr Leu Thr Glu Gin His Leu Glu 
325 330 335 
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tct cag aac gtc acc aaa ggt gcc cgc cac aag ata gcc ctg age ate 
Ser Gin Asn Val Thr Lys Gly Ala Arg His Lys lie Ala Leu Ser lie 
340 345 350 



1654 



cag aag ctg cgt gag aga cag age gtc etc aag tec eta gag aag gat 17 02 

Gin Lys Leu Arg Glu Arg Gin Ser Val Leu Lys Ser Leu Glu Lys Asp 
355 360 365 

gtg ctg gaa ggc ggg aac eta cga aac get ctg cag gag ctg cag cag 1750 
Val Leu Glu Gly Gly Asn Leu Arg Asn Ala Leu Gin Glu Leu Gin Gin 
370 375 380 385 

ate ate ate act ccc ate aag gcc tac agt gtc etc cag gcc acc gtg 1798 
lie lie He Thr Pro He Lys Ala Tyr Ser Val Leu Gin Ala Thr Val 
390 395 400 

get gcc gcc acc ctg tat tgc ggc cgc tct aga gga tec tag ettegta 1847 
Ala Ala Ala Thr Leu Tyr Cys Gly Arg Ser Arg Gly Ser * 
405 410 415 

cgc 1850 



<210> 28 

<211> 4781 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (813) . , (3635) 
<400> 28 

tttgctgtgg aattcccggg tcgacccacg cgtccgcgga cgcgtgggtt tagaggtaag 60 

tttgectact ttgtcgtcta gtgggtaaaa ttttgeggag agcgttggat ctgggaagcg 12 0 

ggatagggat ggatgggttc atttgagagc caeggcttaa agcggttgcg atcaggatgg 180 

gacacaggtt tgtttgggga caacaaagat ggcatttgtg agtgttttga agcaacccgt 240 

actgattaca tctttctccc ttgtgttcct tttatcccag gtttgaattt teteggagaa 300 

agacaggecg gecacgagga aaacagaaac aagccgcagc aacatctaag cccttgaaag 360 

gatcctgaga gaggggggaa agggaaaaca gcagccacca gcccaaccac ttgtgtcttc 42 0 

tgccccttcc cacctatctt gcccacccca ccagcccacg ctgcttggga cttgaaatct 480 

gtggccgaag gaccgtcact acataacttc aaaaataatc aaccaccctc ccttcccaaa 540 

ccacccaaat tcactcatcc agegtttact tttttgaatc cacteagaac ttttttctgc 600 

gacccccctc cctaaatgga gttgggtggg ggggaaatga atactgagtt ggectttatt 66 0 
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J 



ttttaaaaga ctttttgatc caatgaggcc ccctaaataa ttgagttttg ggtcctggtt 720 

ggttgtttta ttttttttcc tccaaaattt taccccctcc cccctgagcc cgaggtgctg 780 

acgtcgcaaa aaaattggat aaaaccacca tc atg ggt teg ggt ccc ata gac 833 

Met Gly Ser Gly Pro lie Asp 
1 5 

ccc aaa gaa ctt etc aag ggc ctg gac age ttc ctt aac cga gat ggg 881 
Pro Lys Glu Leu Leu Lys Gly Leu Asp Ser Phe Leu Asn Arg Asp Gly 
10 15 20 

gaa gtc aaa agt gtg gat ggg att tec aag ate ttc agt ttg atg aag 92 9 

Glu Val Lys Ser Val Asp Gly He Ser Lys He Phe Ser Leu Met Lys 
25 30 35 

gaa gca cga aag atg gtg agt cga tgc act tac ttg aac att etc ctg 977 
Glu Ala Arg Lys Met Val Ser Arg Cys Thr Tyr Leu Asn He Leu Leu 
40 45 50 55 

cag acc cgt tea cca gaa ata ttg gtc aaa ttt att gac gtt ggc ggc 1025 
Gin Thr Arg Ser Pro Glu He Leu Val Lys Phe He Asp Val Gly Gly 
60 65 70 

tac aaa ctt ctt aac aat tgg ctg acg tat tea aag aca acc aac aac 1073 
Tyr Lys Leu Leu Asn Asn Trp Leu Thr Tyr Ser Lys Thr Thr Asn Asn 
75 80 85 

att ccc etc etc cag caa att eta ctg acc ctg cag cat eta ccg etc 1121 
He Pro Leu Leu Gin Gin He Leu Leu Thr Leu Gin His Leu Pro Leu 
90 95 100 

act gta gac cat etc aag cag aac aac aca get aaa ctg gtg aag cag 1169 
Thr Val Asp His Leu Lys Gin Asn Asn Thr Ala Lys Leu Val Lys Gin 
105 110 115 

ctg age aag tea agt gag gat gaa gag etc egg aaa ttg gee tea gtc 1217 
Leu Ser Lys Ser Ser Glu Asp Glu Glu Leu Arg Lys Leu Ala Ser Val 
120 125 130 135 

ctt gtc age gac tgg atg get gtc ate cgc tct cag age agt acc cag 12 65 

Leu Val Ser Asp Trp Met Ala Val He Arg Ser Gin Ser Ser Thr Gin 
140 145 150 

cct get gag aaa gat aag aag aaa cgt aaa gat gaa gga aaa agt cga 1313 
Pro Ala Glu Lys Asp Lys Lys Lys Arg Lys Asp Glu Gly Lys Ser Arg 
155 160 165 

acc acc ctt cct gag cga cct ttg aca gag gtg aag get gag acc egg 13 61 

Thr Thr Leu Pro Glu Arg Pro Leu Thr Glu Val Lys Ala Glu Thr Arg 
170 175 180 

get gag gag gee cca gag aag aag agg gag aag ccc aag tct ctt cgc 14 09 

Ala Glu Glu Ala Pro Glu Lys Lys Arg Glu Lys Pro Lys Ser Leu Arg 
185 190 195 

acc aca gca ccc agt cat gee aag ttc cgt tec act gga eta gag ctg 1457 



88 



Thr Thr Ala Pro Ser His Ala Lys Phe Arg Ser Thr Gly Leu Glu Leu 
200 205 210 215 



gag aca cca tec ttg gtg cct gtg aag aag aat gec age aca gtg gtg 1505 
Glu Thr Pro Ser Leu Val Pro Val Lys Lys Asn Ala Ser Thr Val Val 
220 225 230 

gtt tct gac aag tac aac ctt aaa ccc ate ccc etc aaa cgt cag age 1553 
Val Ser Asp Lys Tyr Asn Leu Lys Pro lie Pro Leu Lys Arg Gin Ser 
235 240 245 

aac gta get get cca gga gat gee act ccc cct gca gag aag aaa tac 1601 
Asn Val Ala Ala Pro Gly Asp Ala Thr Pro Pro Ala Glu Lys Lys Tyr 
250 255 260 

aag cca etc aac aca aca cct aat gee ace aaa gag ate aaa gtg aag 1649 
Lys Pro Leu Asn Thr Thr Pro Asn Ala Thr Lys Glu lie Lys Val Lys 
265 270 275 

ate ate ccg cca cag cct atg gag ggc ctg ggc ttt ctg gat get ctt 1697 
lie lie Pro Pro Gin Pro Met Glu Gly Leu Gly Phe Leu Asp Ala Leu 
280 285 290 295 

aat tea gee cct gtt cca ggc ate aaa att aag aag aaa aaa aaa gta 1745 
Asn Ser Ala Pro Val Pro Gly lie Lys lie Lys Lys Lys Lys Lys Val 
300 305 310 

ctg tea cct acg get gee aag cca age ccc ttt gaa ggg aaa acg age 1793 
Leu Ser Pro Thr Ala Ala Lys Pro Ser Pro Phe Glu Gly Lys Thr Ser 
315 320 325 

aca gaa cca age aca gee aaa cct tct tec cca gaa cca gca cca cct 1841 
Thr Glu Pro Ser Thr Ala Lys Pro Ser Ser Pro Glu Pro Ala Pro Pro 
330 335 340 

tct gag gca atg gac gca gac cgt cca ggc ace ccg gtt ccc cct gtt 1889 
Ser Glu Ala Met Asp Ala Asp Arg Pro Gly Thr Pro Val Pro Pro Val 
345 350 355 

gaa gtc ccg gag etc atg gat aca gee tct ttg gag cca gga get ctg 1937 
Glu Val Pro Glu Leu Met Asp Thr Ala Ser Leu Glu Pro Gly Ala Leu 
360 365 370 375 

gat gee aag cca gtg gag agt cct gga gat cct aac caa ctg ace egg 1985 
Asp Ala Lys Pro Val Glu Ser Pro Gly Asp Pro Asn Gin Leu Thr Arg 
380 385 390 

aaa ggc agg aag agg aaa agt gtg aca tgg cct gag gaa ggc aaa ctg 2 033 
Lys Gly Arg Lys Arg Lys Ser Val Thr Trp Pro Glu Glu Gly Lys Leu 
395 400 405 

aga gaa tat ttc tat ttt gaa ttg gat gaa act gaa cga gta aat gtg 2 081 

Arg Glu Tyr Phe Tyr Phe Glu Leu Asp Glu Thr Glu Arg Val Asn Val 
410 415 420 

aat aag ate aag gac ttt ggt gag gcg get aag cga gag at a ctg tea 212 9 

Asn Lys lie Lys Asp Phe Gly Glu Ala Ala Lys Arg Glu lie Leu Ser 
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425 430 435 

gac cga cat gca ttt gag aca gcg egg cgt ctg age cat gat aac atg 2177 
Asp Arg His Ala Phe Glu Thr Ala Arg Arg Leu Ser His Asp Asn Met 
440 445 450 455 

gag gag aag gtg ccc tgg gtg tgc ccc egg ccc ctg gtt ctg ccc tea 2225 
Glu Glu Lys Val Pro Trp Val Cys Pro Arg Pro Leu Val Leu Pro Ser 
460 465 470 

cct ctt gtc acc cct gga age aat agt cag gag cga tat ate cag get 2273 
Pro Leu Val Thr Pro Gly Ser Asn Ser Gin Glu Arg Tyr lie Gin Ala 
475 480 485 

gag egg gag aag gga ate ctt cag gag etc ttc ctg aac aag gag agt 2321 
Glu Arg Glu Lys Gly lie Leu Gin Glu Leu Phe Leu Asn Lys Glu Ser 
490 495 500 

cct cat gag cct gat cct gag ccc tac gag ccc ata ccc cct aaa etc 2369 
Pro His Glu Pro Asp Pro Glu Pro Tyr Glu Pro lie Pro Pro Lys Leu 
505 510 515 

ate ccc eta gat gag gag tgt tec atg gat gag act ccg tat gtt gag 2417 
lie Pro Leu Asp Glu Glu Cys Ser Met Asp Glu Thr Pro Tyr Val Glu 
520 525 530 535 

act ctg gaa cct ggg ggg tea ggt ggc tea cct gat ggg gca gga ggc 2465 
Thr Leu Glu Pro Gly Gly Ser Gly Gly Ser Pro Asp Gly Ala Gly Gly 
540 545 550 

tec aag ttg cct cca gtt ctg gec aat ctt atg gga age atg ggt get 2513 
Ser Lys Leu Pro Pro Val Leu Ala Asn Leu Met Gly Ser Met Gly Ala 
555 560 565 

gga aag ggc ccc caa ggc cct gga gga gga ggc att aat gtc caa gag 2 561 

Gly Lys Gly Pro Gin Gly Pro Gly Gly Gly Gly lie Asn Val Gin Glu 
570 575 580 

ate etc acc tec ate atg ggt age cca aac agt cat cct tea gag gaa 2609 
lie Leu Thr Ser lie Met Gly Ser Pro Asn Ser His Pro Ser Glu Glu 
585 590 595 

eta ctg aaa caa cca gac tat teg gac aag ate aag cag atg ctg gtg 2657 
Leu Leu Lys Gin Pro Asp Tyr Ser Asp Lys lie Lys Gin Met Leu Val 
600 605 610 615 

cca cat gga etc eta ggc cct ggc cca ata gee aat ggt ttc cca cca 2705 
Pro His Gly Leu Leu Gly Pro Gly Pro lie Ala Asn Gly Phe Pro Pro 
620 625 630 

999 99^ cct ggg ggc ccc aag ggc atg cag cac ttt ccc cct gga cct 2753 
Gly Gly Pro Gly Gly Pro Lys Gly Met Gin His Phe Pro Pro Gly Pro 
635 640 645 

999 99& cct atg cca ggt ccc cat gga ggc cct ggt ggg cca gtg ggt 2 801 

Gly Gly Pro Met Pro Gly Pro His Gly Gly Pro Gly Gly Pro Val Gly 
650 655 660 
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cca cgt ctt ctg ggt cct cca ccc cct ccc egg gga ggt gat ccc ttc 2849 
Pro Arg Leu Leu Gly Pro Pro Pro Pro Pro Arg Gly Gly Asp Pro Phe 
665 670 675 

tgg gat ggc ccg ggc gac cct atg egg ggt ggc cca atg egg ggg ggt 2 897 

Trp Asp Gly Pro Gly Asp Pro Met Arg Gly Gly Pro Met Arg Gly Gly 
680 685 690 695 

cca gga cca ggt cct gga cca tac cat aga ggc cga ggt ggc cga gga 2 945 
Pro Gly Pro Gly Pro Gly Pro Tyr His Arg Gly Arg Gly Gly Arg Gly 
700 705 710 

gga aac gaa cct cct cct cct cct cct cca ttc cga ggc gec aga gga 2993 
Gly Asn Glu Pro Pro Pro Pro Pro Pro Pro Phe Arg Gly Ala Arg Gly 
715 720 725 

ggt cgc tct gga gga gga ccc cca aat gga cga ggg ggc cct ggt ggg 3 041 
Gly Arg Ser Gly Gly Gly Pro Pro Asn Gly Arg Gly Gly Pro Gly Gly 
730 735 740 

ggc atg gtt gga ggt ggt ggg cat cgt cct cac gaa ggc cct ggt ggg 3 089 
Gly Met Val Gly Gly Gly Gly His Arg Pro His Glu Gly Pro Gly Gly 
745 750 755 

ggc atg ggc aac age agt gga cat cgt ccc cac gaa ggc cct ggc ggt 313 7 

Gly Met Gly Asn Ser Ser Gly His Arg Pro His Glu Gly Pro Gly Gly 
760 765 770 775 

ggc atg gga agt ggg cat cgc ccc cat gaa ggc cct ggt ggt age atg 3185 
Gly Met Gly Ser Gly His Arg Pro His Glu Gly Pro Gly Gly Ser Met 
780 785 790 

ggt ggg ggt gga gga cat cgt ccc cac gaa ggc cct ggc ggt ggc ate 32 3 3 

Gly Gly Gly Gly Gly His Arg Pro His Glu Gly Pro Gly Gly Gly He 
795 800 805 

a gt ggt gg c a gt ggc cat cgt ccc cat gaa ggc cct ggc gga gga atg 32 81 

Ser Gly Gly Ser Gly His Arg Pro His Glu Gly Pro Gly Gly Gly Met 
810 815 820 

ggt gee ggt ggt gga cat cgc ccc cac gaa ggc cct ggc gga age atg 3329 
Gly Ala Gly Gly Gly His Arg Pro His Glu Gly Pro Gly Gly Ser Met 
825 830 835 

ggt gga agt ggt gga cat cgt ccc cat gaa ggc cct gga cac ggg ggg 3 37 7 

Gly Gly Ser Gly Gly His Arg Pro His Glu Gly Pro Gly His Gly Gly 
840 845 850 855 

ccc cat ggc cac egg cct cat gat gtc cct ggt cac cga ggc cat gac 3425 
Pro His Gly His Arg Pro His Asp Val Pro Gly His Arg Gly His Asp 
860 865 870 

cat cga ggg ccg cca cct cat gag cac cgt ggc cat gat ggt cct ggc 3473 
His Arg Gly Pro Pro Pro His Glu His Arg Gly His Asp Gly Pro Gly 
875 880 885 
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cac ggg gga ggg ggc cac cga ggg cac gat gga ggc cac age cat gga 3521 
His Gly Gly Gly Gly His Arg Gly His Asp Gly Gly His Ser His Gly 
890 895 900 

gga gac atg tea aac cgc cct gtc tgc cga cat ttc atg atg aag ggc 3569 
Gly Asp Met Ser Asn Arg Pro Val Cys Arg His Phe Met Met Lys Gly 
905 910 915 

aac tgc cgc tat gag aac aac tgt gec ttc tac cac ccg ggt gtc aat 3 617 

Asn Cys Arg Tyr Glu Asn Asn Cys Ala Phe Tyr His Pro Gly Val Asn 
920 925 930 935 



ggg ccc ccc ctg ccc tag ggacca tttgcctgcc ctgttcacac aacccctgtg 3671 
Gly Pro Pro Leu Pro * 
940 



gactgcagcc 


tcgctctttc 


caccctgtta 


tggcttctgt 


gaggeccatt 


ttcccttttc 


3731 


cccagctgat 


gaggagcegg 


ccccctcagt 


tcccacttgc 


ttgggttcct 


gggggttttc 


3791 


tgatcactgg 


tgcgcattga 


tgtacatatt 


ttcctccagt 


ctggggagga 


gagagactgg 


3851 


aaacgttcct 


ggactgctga 


agaggagacc 


cagttggctt 


cactttttga 


gaagattege 


3911 


cctgtacccc 


aaaccccttt 


ccagtattac 


ecttaatget 


tgagaaccta 


aagctggtta 


3971 


tcctggcgaa 


cacccctacc 


ettctattge 


gggtccccac 


atgcacacag 


aactctgaca 


4031 


caggatcagc 


tgcacttaag 


aaatcatccc 


agctaagttc 


attattcctc 


atggggtggg 


4091 


gagatgetga 


aaggggtatt 


gtatatccca 


ctgeactgag 


agggctcaat 


cagctggatt 


4151 


tgagttctgg 


aacacacatc 


atccccaccc 


ctcccccagc 


gtgggctcac 


cattcttagt 


4211 


cctttctcaa 


gtgggacctt 


caactttctg 


tgaacaccca 


gtctgcgtcc 


tgggtctget 


4271 


aggttcgatg 


atggcgaact 


egtatctgea 


tccggtgcaa 


gttttagctg 


gcagaggtga 


4331 


gaccggtggt 


gctggtctgc 


ctttgccaac 


tatagecagt 


ctggagactt 


gataaaatac 


4391 


ttcagtgaga 


ccagcttctc 


atcaacttgg 


gcccggcgtg 


ctgggcctga 


aagtcacact 


4451 


acatgeactg 


cctttgggag 


tcagctcact 


ccctgctccc 


acctggaacc 


ttgccagcgt 


4511 


gaaggaggct 


tccaggtact 


tcaccctgtc 


aaccacctct 


gaatccccac 


caggcgcctt 


4571 


cctgggtgga 


ttcaacaaga 


tgattttgee 


ctttcccagt 


tctctccttc 


actttggcat 


4631 


cagttgtttt 


ctatgaaaac 


agtggattgg 


ttgggttttg 


tgeagggtet 


tgggttagag 


4691 


ccaaaatgga 


tttgaggatg 


agtatttttt 


tttttggttt 


tgtatatttt 


gtacattaat 


4751 


aataaacagt 


ggaaagagaa 


aaaaaaaaaa 








4781 
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<210> 29 

<211> 968 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (184) . . (411) 



<220> 

<221> misc_feature 
<222> (1) . . . (968) 
<223> n = a, t , c or g 



<400> 29 

tttttttttt attattatac tttaagtttt agggtacatg tgcacaacgt gcaggtttgt 60 

tacatatgta tacatgtgcc atgttagtgt gctgcaccca ttaactcgtc atttagcatt 120 

aggtatatct cctacagcta tccctccccc cttcccccac cccacaaaag gtcccagtgt 180 

gtg atg ttc ccc ttc ctg tgt cca tgt gtt etc att gtt caa ttc cca 228 
Met Phe Pro Phe Leu Cys Pro Cys Val Leu lie Val Gin Phe Pro 
15 10 15 

cct atg agt gag aac atg tgg gtt tgg ttt ttt tgt cct tgc gat agt 2 76 

Pro Met Ser Glu Asn Met Trp Val Trp Phe Phe Cys Pro Cys Asp Ser 
20 25 30 

ttg ctg aga atg atg gtt tec age ttc ate cat gtc cct gcg aag gac 324 
Leu Leu Arg Met Met Val Ser Ser Phe lie His Val Pro Ala Lys Asp 
35 40 45 

atg aac tea ccc ttt ttt atg gaa tac tac aca gec ata aaa agg aat 372 
Met Asn Ser Pro Phe Phe Met Glu Tyr Tyr Thr Ala lie Lys Arg Asn 
50 55 60 

gac aac aca tec ctt gca ggg aca tgg atg gag caa tag gccattatcc 421 
Asp Asn Thr Ser Leu Ala Gly Thr Trp Met Glu Gin * 



481 
541 
601 
661 
721 
781 
841 
901 



65 




70 




75 




gtagcaaact 


aatgcaggaa 


gagaaaaccc 


agtactgeat 


gttctcactt 


ataagtggga 


gctaaatgat 


gagaacacat 


gaacacaaag 


aggggaacag 


acactagggc 


cgtttagaag 


ttggcgggtg 


gtttgctttt 


ttntttagnt 


acangattta 


ttagnaatgg 


gtactaggct 


gaataccgtt 


gtggatggta 


gtaatcgggt 


gaacaaagcg 


cccatgttca 


caattttagc 


ttattttttc 


tggttttcgt 


catggtaccc 


cttgtgagtg 


tgagttatat 


gtttggttga 


tatttategg 


ttattggatt 


tggtccctgg 


gtggtgtttt 


cgtggagtgg 


tggggttgtt 


gtgattcttg 


tgtttggctg 


tggtgtttaa 


aagaggtgta 


ggttggtgac 


ccgggtgttg 


tecaegttat 


tggctttgat 


gaggccccct 


tttgtgggtg 


gtgtttggtg 


ttgttttggt 
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ctggtcggtg gccctatttg tgctttttgt tggtagtggc ttcctggtgg tcggtgagcg 961 
gtgtgtt 968 



<210> 30 

<211> 1082 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (168) . . (725) 

<400> 30 

ttttgaaaaa ctgggatagc agtacccagc tggctagcgt ttaaacttaa gcttggtacc 60 

gagctcggat ccactagtcc agtgtggtgg aattccggcc aagtagagct ccgtcctgac 12 0 

gcgccgcctc ccgtgggctc cggccggcta agccgcggcg gacaact atg ctg aaa 176 

Met Leu Lys 
1 

gcc aag ate etc ttc gtg ggg cct tgc gag agt gga aaa act gtt ttg 224 
Ala Lys lie Leu Phe Val Gly Pro Cys Glu Ser Gly Lys Thr Val Leu 
5 10 15 

gcc aac ttt ctg aca gaa tct tct gac ate act gaa tac age cca ace 272 
Ala Asn Phe Leu Thr Glu Ser Ser Asp lie Thr Glu Tyr Ser Pro Thr 
20 25 30 35 

caa gga gtg agg ate eta gaa ttt gag aac ccg cat gtt acc age aac 32 0 

Gin Gly Val Arg lie Leu Glu Phe Glu Asn Pro His Val Thr Ser Asn 
40 45 50 

aac aaa ggc acg ggc tgt gaa ttc gag eta tgg gac tgt ggt ggc gat 3 68 

Asn Lys Gly Thr Gly Cys Glu Phe Glu Leu Trp Asp Cys Gly Gly Asp 
55 60 65 

get aag ttt gag tec tgc tgg ccg gcc ctg atg aag gat get cat gga 416 
Ala Lys Phe Glu Ser Cys Trp Pro Ala Leu Met Lys Asp Ala His Gly 
70 75 80 

gtg gtg ate gtc ttc aat get gac ate cca age cac egg aag gaa atg 464 
Val Val lie Val Phe Asn Ala Asp lie Pro Ser His Arg Lys Glu Met 
85 90 95 

gag atg tgg tat tec tgc ttt gtc caa cag ccg tec tta cag gac aca 512 
Glu Met Trp Tyr Ser Cys Phe Val Gin Gin Pro Ser Leu Gin Asp Thr 
100 105 110 115 

cag tgt atg eta att gca cac cac aaa cca ggc tct gga gat gat aaa 560 
Gin Cys Met Leu lie Ala His His Lys Pro Gly Ser Gly Asp Asp Lys 
120 125 130 
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gga age ctg tct ttg teg cca ccc ttg aac aag ctg aag ctg gtg cac 
Gly Ser Leu Ser Leu Ser Pro Pro Leu Asn Lys Leu Lys Leu Val His 
135 140 145 



608 



agtttaatca cgtgaaaaaa aaaaa 



tea aac ctg gaa gat gac cct gag gag ate egg atg gaa ttc ata aag 656 
Ser Asn Leu Glu Asp Asp Pro Glu Glu He Arg Met Glu Phe He Lys 
150 155 1^0 

tat tta aaa age ata ate aac tec atg tct gag age aga gac agg gag 704 
Tyr Leu Lys Ser He He Asn Ser Met Ser Glu Ser Arg Asp Arg Glu 
165 170 175 

gag atg tea att atg ace tag cc agccttcacc tgggactgee acatccccag 757 
Glu Met Ser He Met Thr * 
180 185 

tgaaatcagc atgtttctcg gtgeagatet gaaatcacat ccagctcctg atgttttctt 817 
ctccctctga ctgcagagga agtgttccta ectgeaggaa ggcacctgtc acacagggcg 877 
ttcactcaga ccatctgtgc tctgccctga gttcagttga gaaaatccta ttatcaaatt 
tggatttcct ggccccagaa cttcccaaag acctgtaaaa tggagggatt taccacctca 
catatgtcca gttaaacagt ttgtggactt gtaacegteg cagcccaatg atacaacagt 1057 



937 
997 



1082 



<210> 31 

<211> 1517 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (666) . . (1406) 
<220> 

<221> misc_f eature 
<222> (1) . . . (1517) 
<223> n = a,t,c or g 

<400> 31 

ctcgaaatcg atactttgee ggaceggnne ggnnnncegg ggtcgacggg aggcaggagg 
gccgacccag gggtgctggc cgccctctgt gagaaaactg acaatgacat ccgggcctgc 
atcaacaccc tgcagttcct gtacagcegg ggccagcggg agetgagegt gcgggacgtg 
caggccacac gcgtgggcct caaggaccag egcagaggge tettcteggt gtggcaggag 
gtcttccagc tgcctcgagc ccagagcacc ccacctgcag gcgccgtgtg ggccaggacc 
ccgccctgcc tgctgacaca ctcctgctgg gtgacgggga cgcgggctcc ctcacctccg 
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cctcacagcg attctaccgt gtcctgcatg ccgctgcctc tgcgggcgag cacgagaagg 42 0 
tggtccaggg cttgtttgac aacttcctgc gtctgcggct gcgagactcc agcctgggtg 



600 
660 



480 

ctgtgtgtgt ggccctcgac tggctggcct tcgatgacct gctggcgggg gctgctcatc 540 

acagccagag cttccagctg ctgcgctacc cacccttcct gcccgtggcc ttccatgtgc 

tgtttgcttc cagccacaca cccaggatca ccttccccag cagccagcag gaggcccaga 

accgg atg age cag atg agg aac ctg ate cag acg ctg gtg tec ggc 707 
Met Ser Gin Met Arg Asn Leu He Gin Thr Leu Val Ser Gly 
15 10 

ate gcg cca gee acg cgc age egg gec acg ccc cag gee ctg etc etc 755 
He Ala Pro Ala Thr Arg Ser Arg Ala Thr Pro Gin Ala Leu Leu Leu 
15 20 25 30 

gat gec etc tgc ctg etc ctg gac att ctt gcg ccc aag etc cgc ccc 803 
Asp Ala Leu Cys Leu Leu Leu Asp He Leu Ala Pro Lys Leu Arg Pro 
35 40 45 

gtg age aca cag ctg tac age acc cgt gaa aag caa cag ctg gec age 851 
Val Ser Thr Gin Leu Tyr Ser Thr Arg Glu Lys Gin Gin Leu Ala Ser 
50 55 60 

ctg gtg ggc acg atg etc get tac age ctg acc tac cgc cag gag cgc 899 
Leu Val Gly Thr Met Leu Ala Tyr Ser Leu Thr Tyr Arg Gin Glu Arg 
65 70 75 

acg ccc gat ggc cag tac ate tac agg ctg gag ccg aac gtg gag gaa 
Thr Pro Asp Gly Gin Tyr He Tyr Arg Leu Glu Pro Asn Val Glu Glu 
80 85 90 

etc tgc cgc ttc cct gag ctg cct gec cgc aag ccc etc acc tac cag 
Leu Cys Arg Phe Pro Glu Leu Pro Ala Arg Lys Pro Leu Thr Tyr Gin 
95 100 105 HO 

acg aag cag etc ate gec cgc gag ate gag gtg gag aag atg egg egg 1043 
Thr Lys Gin Leu He Ala Arg Glu He Glu Val Glu Lys Met Arg Arg 
115 120 125 

gcg gag get tct gee egg gta gag aac age ccc cag gtg gat ggg age 1091 
Ala Glu Ala Ser Ala Arg Val Glu Asn Ser Pro Gin Val Asp Gly Ser 
130 135 140 

ccc cca ggg etc gag ggt ctg ctg ggg ggc att ggg gag aaa ggg gtg 113 9 

Pro Pro Gly Leu Glu Gly Leu Leu Gly Gly He Gly Glu Lys Gly Val 
145 150 155 



947 



995 



1187 



cac cga cct gec cca cgc aac cat gag cag egg ctg gag cac ate atg 
His Arg Pro Ala Pro Arg Asn His Glu Gin Arg Leu Glu His He Met 
160 165 170 

agg cga gcg gee egg gag gaa cag cct gag aag gac ttc ttt gga cgt 12 3 5 

Arg Arg Ala Ala Arg Glu Glu Gin Pro Glu Lys Asp Phe Phe Gly Arg 
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175 180 185 190 

gtg gtc gtc agg age aca gca gtc ccg agt gca ggg gac acg gec ccg 1283 
Val Val Val Arg Ser Thr Ala Val Pro Ser Ala Gly Asp Thr Ala Pro 
195 200 205 

gag cag gac tea gtg gag egg cgc atg ggc aca gcg gtg ggc agg age 1331 
Glu Gin Asp Ser Val Glu Arg Arg Met Gly Thr Ala Val Gly Arg Ser 
210 215 220 

gag gtc tgg ttc cgc ttc aac gag ggt gtc tec aac gec gtg egg cgc 13 79 

Glu Val Trp Phe Arg Phe Asn Glu Gly Val Ser Asn Ala Val Arg Arg 
225 230 235 

age ctg tac ate agg gac ttg etc tag ttctc tgagecgegg acatgccctc 1431 
Ser Leu Tyr lie Arg Asp Leu Leu * 
240 245 

geattgette ccgcagagtg cagagacagg aagctggaga tgtctttata aagtcacacc 1491 

tttacagact gtaaaaaaaa aaaaaa 1517 



<210> 32 

<211> 618 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (498) . . (572) 



60 
120 
180 
240 
300 
360 
420 
480 
530 

Met Pro Val Val Pro Ala Thr Gin Glu Thr Glu 
15 10 

gca ggg gaa ttg cct gaa cct ggg aga cag agg ttg caa tga gecaaga 579 
Ala Gly Glu Leu Pro Glu Pro Gly Arg Gin Arg Leu Gin * 



<400> 
t tttgcctaa 


32 

gatacaatga 


taagtaaaaa 


catgttacag 


agtaataacg 


tgttgcatag 


gacaatactg 


taaatgtggt 


tcattccatc 


aacagacact 


gagccctgcg 


acgtgcctgg 


ctctattcta 


catctgaggg 


acacaaggtg 


aacaagacca 


ggccactgta 


ttcaacatct 


acatttaatg 


gaaatttttg 


aaagaagact 


tgagagatta 


taacagtggt 


cttcagcatc 


agggagtagg 


cctagaagaa 


gaggaggtca 


agaagtggct 


tttcttatta 


tcttcttaac 


tcttcaaatt 


tttactatga 


gcaattatta 


ttttttatta 


aaattttagg 


ccaggcttat 


ggctgtaatc 


ctagcagttt 


gggaggccaa 


ggtgagcgga 


tcacttgagg 


ttgggagttc 


gagaccagcc 


tgaccaacat 


ggagaaactc 


tgtctctact 


taaaaaaaaa 


tacaaaatta 


gecaggcatg 


gtggcac atg cct gta 


gtc cca get act cag gag act gag 
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15 20 25 

tcacgcctcg tgccgaattc ttggcctcga gggccaaat 618 



<210> 33 

<211> 1188 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (313) . . (849) 

<400> 33 

aaggatcctt aattaaatta atcccccccc cccggggaga aacgttctca ctcgctctct 60 

gctcgctgcg ggcgctcccc gccctctgct gccagaacct tggggatgtg cctagacccg 12 0 

gcgcagcaca cgtccgggcc aaccgcgagc agaacaaacc tttggcgggc ggccaggagg 180 

ctccctccca gccaccgccc ccctccagcg cctttttttc cccccataca atacaagatc 240 

ttccttcctc agttccctta aagcacagcc cagggaaacc tcctcacagt tttcatccag 3 00 

ccacgggcca gc atg tct ggg ggc aaa tac gta gac teg gag gga cat 348 
Met Ser Gly Gly Lys Tyr Val Asp Ser Glu Gly His 
15 10 

etc tac acc gtt ccc ate egg gaa cag ggc aac ate tac aag ccc aac 396 
Leu Tyr Thr Val Pro lie Arg Glu Gin Gly Asn lie Tyr Lys Pro Asn 
15 20 25 

aac aag gcc atg gca gac gag ctg age gag aag caa gtg tac gac gcg 444 
Asn Lys Ala Met Ala Asp Glu Leu Ser Glu Lys Gin Val Tyr Asp Ala 
30 35 40 

cac acc aag gag ate gac ctg gtc aac cgc gac cct aaa cac etc aac 492 
His Thr Lys Glu lie Asp Leu Val Asn Arg Asp Pro Lys His Leu Asn 
45 50 55 60 

gat gac gtg gtc aag att gac ttt gaa gat gtg att gca gaa cca gaa 54 0 

Asp Asp Val Val Lys lie Asp Phe Glu Asp Val lie Ala Glu Pro Glu 
65 70 75 

ggg aca cac agt ttt gac ggc att tgg aag gcc age ttc acc acc ttc 588 
Gly Thr His Ser Phe Asp Gly lie Trp Lys Ala Ser Phe Thr Thr Phe 
80 85 90 

act gtg acg aaa tac tgg ttt tac cgc ttg ctg tct gcc etc ttt ggc 63 6 

Thr Val Thr Lys Tyr Trp Phe Tyr Arg Leu Leu Ser Ala Leu Phe Gly 
95 100 105 

ate ccg atg gca etc ate tgg ggc att tac ttc gcc att etc tct ttc 684 
lie Pro Met Ala Leu lie Trp Gly lie Tyr Phe Ala lie Leu Ser Phe 
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110 115 120 

ctg cac ate tgg gca gtt gta cca tgc att aag age ttc ctg att gag 732 
Leu His lie Trp Ala Val Val Pro Cys lie Lys Ser Phe Leu lie Glu 
125 130 135 140 

att cag tgc ate age cgt gtc tat tec ate tac gtc cac acc gtc tgt 780 
lie Gin Cys He Ser Arg Val Tyr Ser He Tyr Val His Thr Val Cys 
145 150 155 

gac cca etc ttt gaa get gtt ggg aaa ata ttc age aat gtc cgc ate 82 8 

Asp Pro Leu Phe Glu Ala Val Gly Lys He Phe Ser Asn Val Arg He 
160 165 170 

aac ttg cag aaa gaa ata taa at gacatttcaa ggatagaagt atacctgatt 8 81 
As n Leu Gin Lys Glu He * 
175 

ttttttcctt ttaattttcc tggtgccaat ttcaagttcc aagttgctaa tacagcaaca 941 

atttatgaat tgaattatct tggttgaaaa taaaaagatc actttctcag ttttcataag 1001 

tattatgtct cttctgagct atttcatcta tttttggcag tctgaatttt taaaacccat 1061 

ttaaattttt ttccttacct ttttatttgc atgtggatca accatcgctt tattggctga 1121 

gatatgaaca tattgttgaa aggtaatttg agagaaatat gaagaactga ggaggaaaaa 1181 

aaaaaaa 1188 



<210> 34 

<211> 920 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (515) . . (697) 
<400> 34 

taagcttgcg geegcaattt tttttttttt ttttttgtat tttttggtag agaegggatt 60 

tcactatgtt ggtcaggctg gtctcgaact cccgaccgca agtgatccac ccgccttggc 120 

ctcccaaagt gctgggatta caagcttgag ccactgcacc cagcctggaa agtatattta 180 

tgaaaggttt gcactccaca aaagcatctt tgctagggtg tcaaggaaga gatcactaaa 240 

ccaaccccaa cacatccata caattccagc aatctagaga gggctggtcc ttttcctttt 300 

ctggattatt ttctgttctc agtaaaacaa gtatttactg tgatactgaa acactgggaa 360 

attaacactg attaagatat tttaaacact gagtcttaat tataacagaa ccagttttca 42 0 
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tcagaatgct tttacgtcac attcagtgaa gtgttacgct aatatattct acagccctga 4 80 

agatagaaaa aaggtttctc tccaggtatg agat atg gta caa aaa tac att 532 

Met Val Gin Lys Tyr lie 
1 5 

ttt cca cat aca aaa gag aga aaa aaa caa aga cat gtg gcg ggt ggc 580 
Phe Pro His Thr Lys Glu Arg Lys Lys Gin Arg His Val Ala Gly Gly 
10 15 20 

9&9 999 agg ccc aat ccc aac acc eta caa ggt tec atg gaa tgg aga 628 
Glu Gly Arg Pro Asn Pro Asn Thr Leu Gin Gly Ser Met Glu Trp Arg 
25 30 35 

agg aac aaa aaa ate ccc aat tat ttt ggg gta aga tgt gee cca gaa 676 
Arg Asn Lys Lys He Pro Asn Tyr Phe Gly Val Arg Cys Ala Pro Glu 
40 45 50 

aag gtg aaa tct atg caa taa aa cccaggtttt cttcaaatct agcatctagg 729 
Lys Val Lys Ser Met Gin * 
55 60 

atttctatca gagtttcaaa taatcagaat ttctatcaga atttctaccc tgaggtgaca 7 89 

cctactaact gtaggttctt tcattaaaaa tgaagacatc tttcaccaga atgtatcaag 849 

ctataaaact ggcttcagag cctacactta gccagagtgg aaaaaaaaaa tagtgeatat 909 

tttcgacagc a 920 
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<220> 
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<222> (133) . . (1122) 



<400> 35 

gtgatcatcg acgcctgcgg taccggtccg gaattccegg gtcgacccac gcgtccgggc 60 

ggcttcctag tgagtcggcg gctgatttag aaggaggttc aggctaeggt gagecgaage 12 0 

cacacaggag cc atg gaa gtg gca gag ccc age age ccc act gaa gag 168 
Met Glu Val Ala Glu Pro Ser Ser Pro Thr Glu Glu 
15 10 

gag gag gag gaa gag gag cac teg gca gag cct egg ccc cgc act cgc 216 
Glu Glu Glu Glu Glu Glu His Ser Ala Glu Pro Arg Pro Arg Thr Arg 
15 20 25 

tec aat cct gaa ggg get gag gac egg gca gta ggg gca cag gee age 264 
Ser Asn Pro Glu Gly Ala Glu Asp Arg Ala Val Gly Ala Gin Ala Ser 
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30 35 40 

gtg ggc age cgc age gag ggt gag ggt gag gec gec agt get gat gat 312 
Val Gly Ser Arg Ser Glu Gly Glu Gly Glu Ala Ala Ser Ala Asp Asp 
45 50 55 60 

ggg age etc aac act tea gga gee ggc cct aag tec tgg cag gtg ccc 360 
Gly Ser Leu Asn Thr Ser Gly Ala Gly Pro Lys Ser Trp Gin Val Pro 
65 70 75 

ccg cca gec cct gag gtc caa att egg aca cca agg gtc aac tgt cca 40 8 

Pro Pro Ala Pro Glu Val Gin lie Arg Thr Pro Arg Val Asn Cys Pro 
80 85 90 

gag aaa gtg att ate tgc ctg gac ctg tea gag gaa atg tea ctg cca 456 
Glu Lys Val lie lie Cys Leu Asp Leu Ser Glu Glu Met Ser Leu Pro 
95 100 105 

aag ctg gag teg ttc aac ggc tec aaa acc aac gec etc aat gtc tec 504 
Lys Leu Glu Ser Phe Asn Gly Ser Lys Thr Asn Ala Leu Asn Val Ser 
110 115 120 

cag aag atg att gag atg ttc gtg egg aca aaa cac aag ate gac aaa 552 
Gin Lys Met lie Glu Met Phe Val Arg Thr Lys His Lys lie Asp Lys 
125 130 135 140 

age cac gag ttt gca ctg gtg gtg gtg aac gat gac acg gee tgg ctg 60 0 

Ser His Glu Phe Ala Leu Val Val Val Asn Asp Asp Thr Ala Trp Leu 
145 150 155 

tct ggc ctg acc tec gac ccc cgc gag etc tgt age tgc etc tat gat 648 
Ser Gly Leu Thr Ser Asp Pro Arg Glu Leu Cys Ser Cys Leu Tyr Asp 
160 165 170 

ctg gag acg gec tec tgt tec acc ttc aat ctg gaa gga ctt ttc age 696 
Leu Glu Thr Ala Ser Cys Ser Thr Phe Asn Leu Glu Gly Leu Phe Ser 
175 180 185 

etc ate cag cag aaa act gag ctt ccg gtc aca gag aac gtg cag acg 744 
Leu lie Gin Gin Lys Thr Glu Leu Pro Val Thr Glu Asn Val Gin Thr 
190 195 200 

att ccc ccg cca tat gtg gtc cgc acc ate ctt gtc tac age cgt cca 792 
lie Pro Pro Pro Tyr Val Val Arg Thr lie Leu Val Tyr Ser Arg Pro 
205 210 215 220 

cct tgc cag ccc cag ttc tec ttg acg gag ccc atg aag aaa atg ttc 840 
Pro Cys Gin Pro Gin Phe Ser Leu Thr Glu Pro Met Lys Lys Met Phe 
225 230 235 

cag tgc cca tat ttc ttc ttt gac gtt gtt tac ate cac aat ggc act 888 
Gin Cys Pro Tyr Phe Phe Phe Asp Val Val Tyr lie His Asn Gly Thr 
240 245 250 

gag gag aag gag gag gag atg agt tgg aag gat atg ttt gee ttc atg 936 
Glu Glu Lys Glu Glu Glu Met Ser Trp Lys Asp Met Phe Ala Phe Met 
255 260 265 
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ggc age ctg gat acc aag ggt acc age tac aag tat gag gtg gca ctg 984 
Gly Ser Leu Asp Thr Lys Gly Thr Ser Tyr Lys Tyr Glu Val Ala Leu 
270 275 280 

get ggg cca gec ctg gag ttg cac aac tgc atg gcg aaa ctg ttg gec 1032 
Ala Gly Pro Ala Leu Glu Leu His Asn Cys Met Ala Lys Leu Leu Ala 
285 290 295 300 

cac ccc ctg cag egg cct tgc cag age cat get tec tac age ctg ctg 1080 
His Pro Leu Gin Arg Pro Cys Gin Ser His Ala Ser Tyr Ser Leu Leu 
305 310 315 

gag gag gag gat gaa gee att gag gtt gag gec act gtc tga accatcc 1129 
Glu Glu Glu Asp Glu Ala lie Glu Val Glu Ala Thr Val * 
320 325 330 

ctgtacatct gcaccttctt gtgcaaggaa gtccttggcc taaagccttg gttctcaaac 1189 

tgggttcctt gggacctccg gggtgggggg gttccaggag gcat 1233 
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